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"Peptides" 

Summary of the invention 

This invention relates to peptides which are fragments of 
protein products arising from frameshift mutations in 
genes, which peptides elicit T cellular immunity, and to 
cancer vaccines and compositions for anticancer treatment 
comprising said peptides. 

The invention further relates to a method for identifying 
such peptides which are fragments of protein products 
arising from frameshift mutations in genes, which may 
elicit T cellular immunity which is useful for combating 
cancer associated with said mutated genes. 

The invention also relates to DNA sequences encoding at 
least one frameshift mutant peptide, and vectors 
comprising at least one insertion site containing a DNA 
sequence encoding at least one frameshift mutant 
peptide . 

Further the invention relates to methods for the 
treatment or prophylaxis of cancers associated with 
frameshift mutations in genes by administration of at 
least one frameshift mutant peptide or a recombinant 
virus vector comprising at least one insertion site 
containing a DNA sequence encoding at least one 
frameshift mutant peptide, or an isolated DNA sequence 
comprising a DNA sequence encoding at least one 
frameshift mutant peptide. 



The present invention represents a further development of 

anticancer treatment or prophylaxis based on the use of 
peptides to generate activation and strengthening of the 
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anti cancer activity of the T 
own immune system. 
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cellular arm of the body's 



5 Technical Background 

Tumour antigens, Status: 

T cell defined antigens have now been characterised in a 
0 broad spectrum of cancer types. These antigens can be 
divided into several main groups, depending on their 
expression. The two main groups are constituted by 
developmental differentiation related antigens 
(tumour- testis antigens, oncofoetal antigens etc., such 
5 as MAGE antigens and CEA) and tissue specific 

differentiation antigens (Tyrosinase, gplOO etc.). The 
group containing the truly tumour specific antigens 
contains proteins that are altered due to mutations in 
the genes encoding them. In the majority of these, the 
3 mutations are unique and have been detected in a single 
or in a small number of tumours. Several of these 
antigens seem to play a role in oncogenesis. 



Cancer vaccines, Status: 

25 

The focus in cancer vaccine development has been on 
antigens expressed in a high degree within one form of 
cancer (such as melanoma) or in many kinds of cancers. One 
reason for this is the increased recruitment of patients 
30 into clinical protocols. The field is in rapid growth, 

illustrated by the accompanying table listing the cancer 
vaccine protocols currently registered in 
the PDQ database of NCI. 



35 
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Inheritable cancer /cancer gene testing: 

Inherited forms of cancer occur at a certain frequency in 
the population. For several of these cancer forms, the 
underlying genetic defects have been mapped. This is also 
the case in Lynch syndrome cancers which constitute an 
important group of inheritable cancer. In families 
inflicted with this syndrome, family members inherit defect 
genes encoding DNA Mismatch Repair (MMR) Enzymes. Carriers 
of such MMR defects frequently develop colorectal cancer 
(HNPCC) and other forms of cancer (list?). Mutations in MMR 
enzymes can be detected using gene testing in the same way 
as other cancer related genes can be detected. 

Gene testing of risk groups in this case represents an 
ethical dilemma, since no acceptable forms for prophylactic 
treatment exist. At present surgery to remove the organ in 
danger to develop cancer has been the only treatment 
option. In these patients, cancer vaccines will be a very 
(interesting) form of prophylaxis, provided efficient 
vaccines can be developed. 

The lack of efficient repair of mismatched DNA results in 
deletions and insertions in one strand of DNA, and this 
happens preferentially in stretches of DNA containing 
repeated units (repeat sequences) . Until now, focus has 
been on repeat sequences in the form of non-coding 
microsattelite loci. Indeed microsattelite instability is 
the hallmark of cancers resulting from MMR defects. We 
have taken another approach, and have concentrated on 
frameshift mutations occurring in DNA sequences coding for 
proteins related to the oncogenic process. Such frameshift 
mutations result in completely new amino acid sequences 
in the C- terminal part of the proteins, prematurely 
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terminating where a novel stop codon appears. This results 
in two important consequences : 

1) The truncated protein resulting from the frameshift is 
generally nonfunctional, in most cases resulting in 
"knocking out" of an important cellular function. Aberrant 
proteins may also gain new functions such as the capacity 
to aggragate and form plaques. In both cases the frameshift 
results in disease. 



2) The short new C-terminal amino acid sequence resulting 
from the shift in the reading frame (the "frameshift 
sequence") is foreign Co the body. It does not exist prior 
to the mutation, and it only exists in cells having the 
mutation, i.e. in tumour cells and their pre malignant 
progenitors. Since they are completely novel and 
therefore foreign to the immune system of the carrier, they 
may be recognised by T-cells in the repertoire of the 
carrier . So far, nobody has focused on this aspect of 
frameshift mutations, and no reports exist on the 
characterisation of frameshift peptides from coding regions 
of proteins as tumour antigens. This concept is therefore 
novel and forms the basis for developing vaccines based on 
these sequences. It follows that such vaccines may also be 
used prophyllactively in persons who inherit defective 
enzymes belonging to the MMR machinery. Such vaccines will 
therefore fill an empty space in the therapeutic 
armament against inherited forms of cancer. 



It has been shown that single amino acid substitutions in 
intracellular " self " -proteins may give rise to tumour 
rejection antigens, consisting of peptides differing in 
their amino acid sequence from the normal peptide. The T 
cells which recognise these peptides in the context of 
the major histocompatibility (MHC) molecules on the 
surface of the tumour cells, are capable of killing the 
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tumour cells and thus rejecting the tumour from the host. 



In contrast to antibodies produced by the B cells, which 
typically recognise a free antigen in its native 
conformation and further potentially recognise almost any 
site exposed on the antigen surface, T cells recognise an 
antigen only if the antigen is bound and presented by a 
MHC molecule. Usually this binding will take place only 
after appropriate antigen processing, which comprises a 
proteolytic fragmentation of the protein, so that the 
resulting peptide fragment fits into the groove of the 
MHC molecule. Thereby T cells are enabled to also 
recognise peptides derived from intracellular proteins. T 
cells can thus recognise aberrant peptides derived from 
anywhere in the tumour cell, in the context of MHC 
molecules on the surface of the tumour cell, and can 
subsequently be activated to eliminate the tumour cell 
harbouring the aberrant peptide. 

M.Barinaga, Science, 257, 880-881, 1992 offers a short 
review of how MHC binds peptides. A more comprehensive 
explanation of the Technical Background for this 
Invention may be found in D. Male et al, Advanced 
Immunology, 1987, J . B. lippincott Company, Philadelphia. 
Both references are hereby included in their entirety. 

The MHC molecules in humans are normally referred to as 
HLA (human leukocyte antigen) molecules. They are encoded 
by the HLA region on the human chromosome No 6. 

The HLA molecules appear as two distinct classes 
depending on which region of the chromosome they are 
encoded by and which T cell subpopulations they interact 
with and thereby activate primarily. The class I 
molecules are encoded by the HLA A, B and C subloci and 
they primarily activate CD8+ cytotoxic T cells. The HLA 
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class II molecules are encoded by the DR, DP and. DQ 
subloci and primarily activate CD4+ T cells, both helper 
cells and cytotoxic cells. 

Normally every individual has six HLA Class I molecules, 
usually two from each of the three groups A, B and C. 
Correspondingly, all individuals have their own selection 
of HLA Class II molecules, again two from each of the 
three groups DP, DQ and DR. Each of the groups A, B, C 
and DP, DQ and DR are again divided into several 
subgroups. In some cases the number of different HLA 
Class I or II molecules is reduced due to the overlap of 
two HLA subgroups . 

All the gene products are highly polymorphic. Different 
individuals thus express distinct HLA molecules that 
differ from those of other individuals. This is the basis 
for the difficulties in finding HLA matched organ donors 
in transplantations. The significance of the genetic 
variation of the HLA molecules in immunobiology is 
reflected by their role as immune-response genes. Through 
their peptide binding capacity, the presence or absence 
of certain HLA molecules governs the capacity of an 
individual to respond to peptide epitopes. As a 
consequence, HLA molecules determine resistance or 
susceptibility to disease. 

T cells may control the development and growth of cancer 
by a variety of mechanisms. Cytotoxic T cells, both HLA 
class I restricted CD8+ and HLA Class II restricted CD4+, 
may directly kill tumour cells carrying the appropriate 
tumour antigens. CD4 + helper T cells are needed for 
cytotoxic CD8+ T cell responses as well as for antibody 
responses, and for inducing macrophage and LAK cell 
killing . 
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A requirement for both HLA class I and II binding is that 
the peptides must contain a binding motif, which usually 
is different for different HLA groups and subgroups. A 
binding motif is characterised by the requirement for 
amino acids of a certain type, for instance the ones 
carrying large and hydrophobic or positively charged side 
groups, in definite positions of the peptide so that a 
narrow fit with the pockets of the HLA binding groove is 
achieved. The result of this, taken together with the 
peptide length restriction of 8-10 amino acids within the 
binding groove, is that it is quite unlikely that a 
peptide binding to one type of HLA class I molecules will 
also bind to another type. Thus, for example, it may very 
well be that the peptide binding motif for the HLA-A1 and 
HLA-A2 subgroups, which both belong to the class I 
gender, are as different as the motifs for the HLA-A1 and 
HLA-Bl molecules. 



For the same reasons it is not likely that exactly the 
same sequence of amino acids will be located in the 
binding groove of the different class II molecules. In 
the case of HLA class II molecules the binding sequences 
of peptides may be longer, and it has been found that 
they usually contain from 10 to 16 amino acids, some of 
which, at one or both terminals, are not a part of the 
binding motif for the HLA groove. 

However, an overlap of the different peptide binding 
motifs of several HLA class I and class II molecules may 
occur. Peptides that have an overlap in the binding 
sequences for at least two different HLA molecules are 
said to contain "nested T cell epitopes". The various 
epitopes contained in a "nested epitope peptide" may be 
formed by processing of the peptide by antigen presenting 
cells and thereafter be presented to T cells bound to 
different HLA molecules. The individual variety of HLA 
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molecules in humans makes peptides containing nested 
epitopes more useful as general vaccines than peptides 
that are only capable of binding to one type of HLA 
molecule . 



Effective vaccination of an individual can only be 
achieved if at least one type of HLA class I and/or II 
molecule in the patient can bind a vaccine peptide either 
in it's full length or as processed and trimmed by the 
patient's own antigen presenting cells. 

The usefulness of a peptide as a general vaccine for the 
majority of the population increases with the number of 
different HLA molecules it can bind to, either in its 
full length or after processing by antigen presenting 
cells . 



In order to use peptides derived from a protein encoded 
by a mutated gene as vaccines or anticancer agents to 
generate anti tumour CD4+ and/or CD8+ T cells, it is 
necessary to investigate the mutant protein in question 
and identify peptides that are capable, eventually after 
processing to shorter peptides by the antigene presenting 
cells, to stimulate T cells. 



Prior art 



In our International Application PCT/NO92/00032 
(published as W092/14756), we described synthetic 
peptides and fragments of oncogene protein products which 
have a point of mutation or translocations as compared to 
their proto-oncogene or tumour suppressor gene protein. 
These peptides correspond to, completely cover or are 
fragments of the processed oncogene protein fragment or 
tumour suppressor gene fragment as presented by cancer 
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cells or other antigen presenting cells, and are 
presented as a HLA-peptide complex by at least one allele 
in every individual. These peptides were also shown to 
induce specific T cell responses to the actual oncogene 
protein fragment produced by the cell by processing and 
presented in the HLA molecule. In particular, we 
described peptides derived from the p21 ras protein which 
had point mutations at particular amino acid positions, 
namely position 12, 13 and 61. These peptides have been 
shown to be effective in regulating the growth of cancer 
cells in vitro. Furthermore, the peptides were shown to 
elicit CD4+ T cell immunity against cancer cells 
harbouring the mutated p21 ras oncogene protein through 
the administration of such peptides in vaccination or 
cancer therapy schemes. Later we have shown that these 
peptides also elicit CD8+ T cell immunity against cancer 
cells harbouring the mutated p21 ras oncogene protein 
through the administration mentioned above. 

However, the peptides described above will be useful only 
in certain number of cancers, namely those which involve 
oncogenes with point mutations or translocation in a 
proto-oncogene or tumour suppressor gene. There is 
therefore a strong need for an anticancer treatment or 
vaccine which will be effective against a more general 
range of cancers . 

In general, tumors are very heterogenous with respect to 
genetic alterations found in the tumour cells. This 
implies that both the potential therapeutic effect and 
prophylactic strength of a cancer vaccine will increase 
with the number of targets that the vaccine is able to 
elicit T cell immunity against. A multiple target vaccine 
will also reduce the risk of new tumour formation by 
treatment escape variants from the primary tumour. 
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Definition of Problem solved by the Invention, 



There is a continuing need for new anticancer agents based 
on antigenic peptides giving rise to specific T cellular 
responses and toxicity against tumours and cancer cells 
carrying genes with mutations related to cancer. The 
present invention will contribute largely to supply new 
peptides that can have a use in the combat and prevention 
of cancer as ingredients in a multiple target anti-cancer 
vaccine. 

Another problem solved by the present invention is that 
a protection or treatment can be offered to the 
individuals belonging to family's or groups with high 
risk for hereditary cancers. Hereditary cancers are in 
many cases associated with genes susceptible to 
frameshift mutations as described in this invention (i.e. 
mutations in mismatch repair genes) . Today it is possible 
to diagnose risk of getting hereditary cancer but no 
pharmaceutical method for protection against the onset of 
the cancer is available. 

Definition of the Invention 

A main object of the invention is to obtain peptides 
corresponding to peptide fragments of mutant proteins 
produced by cancer cells which can be used to stimulate T 
cells . 

Another main object of the invention is to develop a 
cancer therapy for cancers based on the T cell immunity 
which may be induced in patients by stimulating their T 
cells either in vivo or in vitro with the peptides 
according to the invention. 
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A third main object of the invention is to develop a 
vaccine to prevent the establishment of or to eradicate 
cancers based solely or partly on peptides corresponding 
to peptides of the present invention which can be used to 
generate and activate T cells which produce cytotoxic T 
cell immunity against cells harbouring the mutated genes. 

A fourth main object of the invention is to design an 
anticancer treatment or prophylaxis specifically adapted 
to a human individual in need of such treatment or 
prophylaxis, which comprises administering at least one 
peptide according to this invention. 

These and other objects of the invention are achieved by 
the attached claims. 

Since frameshift mutations result in premature stop 
codons and therefore deletion in large parts of the 
proteins, proteins with frameshift mutations have 
generally not been considered to be immunogenetic and 
have therefore not been considered as targets for 
immunotherapy. Thus it has now surprisingly been found 
that a whole group of new peptides resulting from 
frameshift mutations in tumour relevant genes are useful 
for eliciting T cell responses against cancer cells 
harbouring genes with such frameshift mutations. 

Genes containing a mono nucleoside base repeat sequence 
of at least five residues, for example of eighth 
deoxy adenosine bases (AAAAAAAA) , or a di -nucleoside base 
repeat sequence of at least four di -nucleoside base 
units, for example of two deoxyadenosine-deoxycytosine 
units (ACAC) , are susceptible to frameshift mutations. 
The frameshift mutations occur, respectively, either by 
insertion of one or two of the morio-nucleoside base 
residue or of one or two of the di-nucleoside base unit 
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in the repeat sequence, or by deletion of one or two of 
the mono-nucleoside base residue or of one or two of the 
di-nucleoside base unit from the repeat sequence. A gene 
with a frameshift mutation will from the point of 
mutation code for a protein with a new and totally 
different amino acid sequence as compared to the normal 
gene product. This mutant protein with the new amino acid 
sequence at the carboxy end will be specific for all 
cells carrying the modified gene. 

In the remainder of this specification and claims the 
denomination frameshift mutant peptides will comprise 
such proteins and peptide fragments thereof. 

It has now according to the present invention been found 
that such new protein sequences arising from frameshift 
mutations in genes in cancer cells give rise to tumour 
rejection antigens that are recognised by T cells in the 
context of HLA molecules. 

It has further according to the present invention been 
found a group of peptides corresponding to fragments of 
mutant proteins arising from frameshift mutations in 
genes in cancer cells which can be used to generate T 
cells. The said peptides can therefore also be used to 
rise a T cell activation against cancer cells harbouring 
a gene with a frameshift mutation as described above. 

These peptides are at least 8 amino acids long and 
correspond, either in their full length or after 
processing by antigen presenting cells, to the mutant 
gene products or fragments thereof produced by cancer 
cells in a human patient afflicted with cancer. 

A peptide according to this invention is characterised in 
that it 
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a) is at least 8 amino acids long and is a 
fragment of a mutant protein arising from a 
frameshift mutation in a gene of a cancer cell; 

and 

b) consists of at least one amino acid of the 
mutant part of a protein sequence encoded by said 
gene; 

and 

c) comprises 0-10 amino acids from the carboxyl 
terminus of the normal part of the protein 
sequence preceding the amino terminus of the 
mutant sequence and may further extend to the 
carboxyl terminus of the mutant part of the 
protein as determined by a new stop codon 
generated by the frameshift mutation in the gene; 

and 

d) induces, either in its full length or after 
processing by antigen presenting cell, T cell 
responses . 



The peptides of this invention contain preferably 8-2 5, 
9-20, 9-16, 8-12 or 20-25 amino acids. They may for 
instance contain 9, 12, 13, 16 or 21 amino acids. 



It is most preferred that the peptides of the present 
invention are at least 9 amino acids long, for instance 
9-18 amino acids long, but due to the processing 
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possibility of the antigen presenting cells also longer 
peptides are very suitable for the present invention. Thus 
the whole mutant amino acid sequence may be used as a 
frameshift mutant peptide according to the present 
invention, if it comprises 8 amino acids or more. 

The invention further relates to a method for vaccination 
of a person disposed for cancer, associated with a 
frameshift mutation in a gene, consisting of administering 
at least one peptide of the invention one or more times in 
an amount sufficient for induction of T-cell immunity to 
the mutant proteins encoded by the frameshift mutated gene. 

The invention also relates to a method for treatment of a 
patient afflicted with cancer associated with frameshift 
mutation in genes, consisting of administering at least one 
peptide of the invention one or more times in an amount 
sufficient for induction of T-cell immunity to mutant 
proteins arising from frameshift mutations in the genes of 
cancer cells. 

Furthermore, it has according to the present invention been 
found a method for identifying new peptides which 
correspond to fragments of proteins arising from frameshift 
mutations in genes. This method is characterised by the 
following steps: 

1) identifying a gene in a cancer cell susceptible 
to frameshift mutation by having a mono nucleoside 
base repeat sequence of at least five residues, or 
a di-nucleoside base repeat sequence of at least 
four di-nucleoside base units; 



and 
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2) removing, respectively, one nucleoside base 
residue or one di-nucleoside base unit from the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 

and/ or 

3) removing, respectively, two nucleoside base 
residues or two di -nucleoside base units from the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 

and/or 

4) inserting, respectively, one nucleoside base 
residue or one di-nucleoside base unit in the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 

and/ or 

5) inserting, respectively, two nucleoside base 
residues or two di-nucleoside base units in the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon. 

In order to determine whether the peptides thus 
identified are useable in the compositions and methods 
according to the present invention for the treatment or 
prophylaxis of cancer, the following further step should 
be performed: 
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6) determining whether the new peptide, either in 
their full length or as shorter fragments of the 
peptides, are able to stimulate T cells. 

Optionally a further step may be added as follows: 

7) determining peptides containing nested epitopes 
for different major HLA class I and/or HLA class II 
molecules . 



Detail ed Description of the invention. 

In the present description and claims, the amino acids 
are represented by their one letter abbreviation as known 
in the art. 

The peptides of the present invention shall be explicitly 
exemplified through two different embodiments, wherein 
cancer develops based on frameshift mutations in specific 
genes, namely the BAX gene and TGFpRll gene: 

I) BAX gene 

It has been established that the BAX gene is involved in 
regulation of survival or death of cells by promoting 
apoptosis. The human BAX gene contains a repeat sequence 
of eight deoxyguanosine bases (G8) in the third exon, 
spanning codons 38 to 41 (ATG_GGG_GGG_GAG) . 

Frameshift mutations in this G8 repeat have been 
observed, both as G7 (ATG_GGG_jGGG AGG) and G9 (A TG GGG 
GGG GGA ) repeats, both in colon cancer cells and prostate 
cancer cells. The occurency is more than 50% of the 
examined cases (Rampino, N. et al . , "Somatic frameshift 
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mutations in the BAX gene in colon cancers of the 
microsatellite mutator phenotype.", Science (Washington 
DC), 275: 967-969, 1997). The modified BAX gene products 
are unable to promote apoptosis and thus makes further 
tumour progress possible. Furthermore the modified gene 
products are only found in cancer cells and are therefore 
targets for specific immunotherapy. 

According to the present invention, peptides 
corresponding to the transformed BAX protein products 
arising from frameshif t mutations in the BAX gene can be 
used as anticancer therapeutical agents or vaccines with 
the function to trigger the cellular arm of the immune 
system (T-cells) against cancer cells in patients 
afflicted with cancers associated with a modified BAX 
gene . 

Frameshift mutations in the BAX gene result in mutant 
peptide sequences with the first amino acid of the 
altered sequence in position 41 as compared to the normal 
BAX protein (Table 1, seq.id. no. 1 to 4). 

Table 1 

amino acid pos 41 51 61 71 

normal bax peptide ; EAPELALDPV PQDASTKKLS ECLKRIGDEL 
DS. . . 

seq . id . no . 1 ( bax- 1G ) ; RHPSWPWTRC LRMRPPRS 
seq.id.no. 4(bax+2G); GRHPSWPWTR CLRMRPPRS 

seq.id.no. 2(bax-2G); GTRAGPGPGA SGCVHQEAER VSQAHRGRTG Q 
seq.id.no. 3(bax+lG); GGTRAGPGPG ASGCVHQEAE RVSQAHRGRT GQ 
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Table 2 shows one group of peptides according to the 
present invention : 



Table 2 

seq.id.no. 5 

seq.id.no. 6 

seq.id.no. 7 

seq.id.no. 8 



IQDRAGRMGGRHPSWPWTRCLRMRPPRS 
I QDRAGRMGGGRHPSWPWT 

IQDRAGRMGGGGTRAGPGPGASGCVHQEAERVSQAHRGRTGQ 
I QDRAGRMGGGTRAGPGPG 



The peptides listed in Table 3 were used for in vitro 
generation of T cells that recognise mutant BAX peptides, 



Table 3. 
seq id no 1 
seq id no 9 
seq id no 6 
seq id no 10 
seq id no 11 
seq id no 12 
seq id no 8 



RHPSWPWTRCLRMRPPRS 

I QDRAGRMGGRH P S WPWTRC LR 

IQDRAGRMGGGRHPSWPWT 

ASGCVHQEAERVSQAHRGRTGQ 

GGTRAG PG PGAS GC VHQEAERV 

I QDRAGRMGGGGTRAG PG PG AS 

I QDRAGRMGGGTRAG PG PG 



The most preferred peptides according to this embodiment 
of the present invention are listed in Table 4: 



Table - 


1 






seq 


id 


no 


1 


: RHPSWPWTRCLRMRPPRS 


seq 


id 


no 


2 


: GTRAG PG PG ASGC VHQEAERVS QAHRGRTGQ 


seq 


id 


no 


3 


: GGTRAGPGPGASGCVHQEAERVSQAHRGRTGQ 


seq 


id 


no 


4 


: GRHPSWPWTRCLRMRPPRS 


seq. 


id 


.no. 


5 


: IQDRAGRMGGRHPSWPWTRCLRMRPPRS 


seq. 


id 


.no . 


6 


I QDRAGRMGGGRHPSWPWT 


seq. 


id 


.no . 


7 


IQDRAGRMGGGGTRAGPGPGASGCVHQEAERVSQAHRGRTGQ 


seq 


id 


no 


8. 


I QDRAGRMGGGTRAGPGPG 


seq 


id 


no 


9 : 


I QDRAGRMGGRH P S WPWTRC LR 


seq 


id 


no 


10: 


ASGCVHQEAERVSQAHRGRTGQ 


seq 


id 


no 


11 : 


GGTRAG PG PGASGCVHQEAERV 
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seq id no 12 : I QDRAGRMGGGGTRAG PGPGAS 
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2) TGFpRII 

It has been established that the TGF(3RII gene is involved 
in regulation of cell growth. TGF|3rii is a receptor for 
TGF(3 which down regulates cell growth. The human gene 
coding for TGFpRII contains a repeat sequence of ten 
deoxyadenosine bases (A10) from base no. 709 to base no. 
718 (G AA AAA AAA AAG CCT) . In colon cancers and 
pancreatic cancers frameshift mutations in this A10 
repeat have been observed, both as A9 ( GAA AAA AAA AG C 
CT) and All (G AA AAA AAA AAA GCC) repeats, in 
approximately 80 % of the examined cases (Yamamoto, H., 
"Somatic frameshift mutations in DNA mismatch repair and 
proapoptosis genes in hereditary nonpolyposis colorectal 
cancer. Cancer Research 58, 997-1003, March 1, 1998). 
The modified TGFfiRII gene products are unable to bind 
TGF(3 and the signal for down regulation of cell growth 
is eliminated and thus makes further tumour progress 
possible. Furthermore the modified gene products are only 
found in cancer cells and are therefore targets for 
immunotherapy . 

Consequently peptides corresponding to the transformed 
TGFpRII protein products arising from frameshift 
mutations in the TGF(3RII gene can be used as anticancer 
therapeutical agents or vaccines with the function to 
trigger the cellular arm of the immune system (T-cells) 
against cancer cells in patients afflicted with cancers 
associated with a modified TGF(3rh gene. 

Frameshift mutations in the TGFfiRII gene result in mutant 
peptide sequences with the first amino acid of the 
altered sequence in either position 133 (one and two base 
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deletions) or 134 (one and two base insertions) as 
compared to the normal TGFpRII protein (Table 5, 
seq.id.nos. 13 and 21). 

Table 5. 

amino acid pos. 133 

normal TGF0RII ; K PGETFFMCSC SSDECNDNII FSEEYNTSNP 

DLLL 

seq id no 13(-1A); S LVRLSSCVPV ALMSAMTTSS SQKNITPAIL TCC 
seq id no 13(+2A); SLVRLSSCVP VALMS AMTT S SSQKNITPAI 

LTCC 

TGFbRII + 1A) ; AW 

TGFbRII - 2A) ; A W 

Table 6 shows one groups of peptides of this invention: 
Table 6 

seq id no 14 : SPKCIMKEKKSLVRLSSCVPVALMSAMTTSSSQKNITPAILTCC 
seq id no 15 : PKCIMKEKKKSLVRLSSCV 
seq id no 19 : SPKCIMKEKKAW 
seq id no 20 : PKCIMKEKKKAW 

Table 7 presents peptides that were used for in vitro 
generation of T cells that recognise mutant TGF(3RII 
peptides . 



Table ' 


7 






seq 


id 


no 


15: 


PKC IMKEKKKSLVRLS SCV 


seq 


id 


no 


16: 


ALMSAMTTS SSQKNITPAI LTCC 


seq 


id 


no 


17: 


SLVRLSSCVPVALMSAMTTSSSQ 


seq 


id 


no 


18: 


S PKCIMKEKKSLVRLS SCVPVA 


seq 


id 


no 


19: 


SPKCIMKEKKAW 


seq 


id 


no 


20: 


PKCIMKEKKKAW 


seq 


id 


no 


21: 


AMTTS S SQKN I T PAI LTCC 


seq 


id 


no 


428: 


SLVRLSSCV 
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The most preferred peptides of this embodiment of the 
present invention are: 

5 Table 8 

seq id no 13 : SLVRLSSCVPVALMSAMTTSSSQKNITPAILTCC 

seq id no 14 : SPKCIMKEKKSLVRLSSCVPVALMSAMTTSSSQKNITPAILTCC 

seq id no 15 : PKCIMKEKKKSLVRLSSCV 

seq id no 16 : ALMSAMTTSSSQKNITPAILTCC 
10 seq id no 17 : SLVRLSSCVPVALMSAMTTSSSQ 

seq id no 18 : SPKCIMKEKKSLVRLSSCVPVA 

seq id no 19 : SPKCIMKEKKAW 

seq id no 2 0 : PKC IMKEKKKAW 

seq id no 21 : AMTTSSSQKNITPAILTCC 
15 seq id no428 : SLVRLSSCV 

Other peptides of the invention can be fragments of the 
peptides listed in the Tables 1-8 above. Such fragments 
are most preferred from 9-16 amino acids long and include 
20 at least one amino acid from the mutant part of the 
protein. 

As used in this description and claims the term fragment 
is intended to specify a shorter part of a longer peptide 
25 or of a protein. 

Other cancer associated genes containing repeat sequences 
of a nucleoside base and which therefore are susceptible 
to frameshift mutations and consequently are potential 
30 candidates for peptides according to the present 

invention (seq id nos according to table 9 are given in 
parentheses in each case) are the following: 



35 



Human TGF-|3-2 (hTGF(32) gene (seq id nos 22-29) 
Deleted in colorectal cancer (DCC) gene (seq. id. nos 
30-34) 
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Human breast and ovarian cancer susceptibility (BRCA1) 
gene (seq.id.nos. 378-387) 

Human breast cancer susceptibility (BRCA2) gene 
(seq.id.nos. 35-94) 

Human protein tyrosine phosphatase (hPTP) gene 
(seq.id.nos. 95-102) 

Human DNA topoisomerase II (top2) gene (seq.id.nos. 
103-108) 

Human kinase (TTK) gene (seq.id.nos. 109-120) 

Human transcriptional repressor (CTCF) gene (seq.id.nos. 

121-127) 

Human FADD-homologous ICE/CED-3-like protease gene 
(seq.id.nos. 128-133) 

Human putative mismatch repair/binding protein (hMSH3) 
gene (seq.id.nos. 134-147) 

Human retinoblastoma binding protein 1 isoform I (hRBPl) 
gene (seq.id.nos. 148-156) 

Human FMR1 (hFMRl) gene (seq.id.nos. 157-161) 
Human TINUR gene (seq.id.nos. 162-169) 
b-raf oncogene (seq.id.nos. 170-175) 

Human neurof ibromin (NF1) gene (seq.id.nos. 176-181) 

Human germline n-myc gene (seq.id.nos. 182-188) 

Human n-myc gene (seq.id.nos. 189-194) 

Human ras inhibitor gene (seq.id.nos. 195-199) 

Human hMSH6 gene (seq.id.nos. 200-203 and 293-297) 

Human nasopharynx carcinoma EBV BNLF-1 gene (seq.id.nos. 

204-210) 

Human cell cycle regulatory protein (ElA-binding protein) 
p300 gene (seq.id.nos. 211-218) 

Human B-cell lymphoma 3-encoded protein (bcl-3) gene 
(seq.id.nos. 219-226) 

Human transforming growth factor-beta induced gene 

product (BIGH3) (seq.id.nos. 227-232) 

Human transcription factor ETV1 gene, (seq.id.nos. 

233-239) 
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Human insulin-like growth factor binding protein (IGFBP4) 

gene (seq.id.nos. 240-246) 

Human MUC1 gene (seq.id.nos. 247-266) 

Human protein-tyrosine kinase (JAK1) gene (seq.id.nos. 
267-271) 

Human protein-tyrosine kinase (JAK3) gene (seq.id.nos. 
272-279) 

Human Flt4 gene (for transmembrane tyrosinase kinase) 
(seq.id.nos. 280-284) 

Human p53 associated gene (seq.id.nos. 285-292) 

Human can (hCAN) gene (seq.id.nos. 298-300) 

Human DBL (hDBL) proto-oncogene / Human MCF2PO <hMCF2PO) 

gene (seq.id.nos. 301-306) 

Human dek (hDEK) gene (seq.id.nos. 307-309) 
Human retinoblastoma related protein (pl07) gene 
(seq.id.nos. 310-313) 

Human G protein-coupled receptor (hGPRl) gene 
(seq.id.nos. 314-319) 

Human putative RNA binding protein (hRBP56) gene 
(seq.id.nos. 320-325) 

Human transcription factor (hITF-2) gene (seq.id.nos. 
326-327) 

Human malignant melanoma metastasis-supressor (hKiSS-1) 
gene (seq.id.nos. 328-334) 

Human telomerase-associated protein TP-1 (hTP-1) gene 
(seq.id.nos. 335-348) 

Human FDF-5 (hFDF-5) gene (seq.id.nos. 349-356) 
Human metastasis-assosiated mtal (hMTAl) gene 
(seq.id.nos. 357-362) 

Human transcription factor TFIIB 90 kDa subunit 

(hTFIIB90) gene (seq id nos 363-369) 

Human tumour suppressor (hLUCA-1) gene (seq id nos 

370-377) 

Human Wilm's tumour (WIT-1) associated protein (seq id 
nos 388-393) 



10 
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Human cysteine protease (ICErel-IIl) gene (seq id nos 
394-398 and 459) 

Human Fas ligand (FasL) gene (seq id nos 399-403) 
Human BRCAl-associated RING domain protein (BARD1) gene 
(seq id nos 404-417) 

Human mcf.2 (hMCF.2) gene (seq id nos 418-422) 
Human Fas antigen (fas) gene (seq id nos 423-427) 
Human DPC4 gene (seq id nos 429-437) . 

The mutant peptides that are the results of frameshift 
mutation in these genes, in accordance with the present 
invention, are listed in table 9. 





Table 


9 






15 


seq 


id 


no 


22; 


TVGRPHISC 




seq 


id 


no 


23; 


KTVGRPHISC 




seq 


id 


no 


24; 


KQWEDPTSPANVT ALLOT 




seq 


id 


no 


25; 


QWEDPTSPANVIALLQT 




seq 


id 


no 


26; 


QKT I K STRKKTVGR PH I SC 


20 


seq 


id 


no 


27 • 






seq 


id 


no 


28; 


QKTIKSTRKKKQWEDPTSPANVIALLQT 




seq 


id 


no 


29; 


QKTIKSTRKKQWEDPTSPANVIALLQT 




seq 


id 


no 


30; 


AADLQQQFVHFLDCWDVSSIPFTLHLPQAQDITT 




seq 


id 


no 


31; 


GKDAKEKSS 


25 


seq 


id 


no 


32; 


GKDAKEKKSS 




seq 


id 


no 


33; 


GKDAKEKKAADLQQQFVHFLDCWDVSSIPFTLHLPQAQDITT 




seq 


id 


no 


34; 


GKDAKEKAADLQQQFVHFLDCWDVS S I PFTLHL PQAQDITT 




seq 


id 


no 


35; 


FSMKQTLMNVKNLKTK 




seq 


id 


no 


36; 


KFSMKQTLMNVKNLKTK 


30 


seq 


id 


no 


37; 


VRT SKTRKKF SMKQTLMNV3GMLKTK 




seq 


id 


no 


38; 


VRTS KTRKKKF SMKQTLMNVKNLKTK 




seq 


id 


no 


39; 


VRTSKTRKKNFP 




seq 


id 


no 


40; 


VRTSKTRKNFP 




seq 


id 


no 


41; 


IKKKLLQFQK 


35 


seq 


id 


no 


42; 


KIKKKLLQFQK 




seq 


id 


no 


43; 


KSRRNYFNFKNNCQSRL 
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sea 


id 


no 


44; 


SRRNYFNFKNNrn^PT, 




sea 


id 


no 


45; 


TNLRVIOKIKKKI/T.OFfjK 




seQ 


id 


nn 


46 • 






seq 


id 


no 


47 ■ 


TlMTi'R\7TnT<rT<r QPPMVPMPT^TvTM/^OCPT 
-LiMurv v iyAi\oi\xu\l x r IMP xSx\lx\IL*yo,Klj 


5 


seq 


id 


nn 


48 • 


TTvIT .PVTOIf QPPMVPMPTf KT7\Tr , f > lCPT 
J. i\j-i_jx\v x. ^r\ox\ rsx\i x r iMr IvLNJlNlt^yoKlj 




seQ 


id 


no 


49 • 


KTMTT 




seq 


id 


no 


50 ; 


NIDKTPFKTMTT 

X » -LJ— '1\_L xTXIi X\ X i v X X X 




seq 


id 


no 


51 • 


NTDKTPFKKTMTT 




seq 


id 


no 


52 • 


1 1 NAN 

-X. _L X\J xiXN 


10 


sea 


id 


no 


53 ; 


KIT NAN 




seq 


id 


no 


54 • 


NDKTV^FK T TNAN 




seq 


id 


no 


55 : 


NDKTV^FKK T TMAN 




seq 


id 


no 


56 • 


T . F P V T /MA 7M C\ T<T F 




seq 


id 


no 


57 • 


V 1 Oxj±Jxlj/Ax\X\JvjJXJXwI\.Ci X XjIYx Vl\iyj\Xj 


15 


seq 


id 


no 


5ft • 


^HT^T.T PAWMr*T PVPVT M\rw\r\VT? 






-L Li. 










seq 


id 


no 


60 ■ 


COT Q T .T .F Z\ "PTM 2\ 




seq 


id 




61 • 

ux. , 


TT.VFPPr 
l Jjvr Jris. 




seq 


id 


no 


62 ■ 


KTT.VFPK 


20 


seq 


id 


no 


63 • 


T,KNVFnfYK' r rT \7P PT^ 




seq 


id 

-X. \J. 


no 


64 • 


T ,TC "KT\ 7P HO V It T T 1 TP P T<T 
X-rxSXM VxjxJv^/JXxS. X XjVX* XT IS. 




seq 


id 


no 


65 • 


XjxNXM V xjx^^xxxvlx 




sea 


id 


no 


66 ; 


LKNVFDOKH 

J— IXVLM V J—J Ly^lvi 1 




seq 


id 


no 


67 • 


KK TOT.Y 

XXXV.X ^J_i X 


25 


seq 


id 


no 


68 • 


XXXXXA. -L V^XJ X 




seq 


id 


no 


69 • 


RKRF^ VTPVT.ZV QTTPPTPQ\7i\7PPT^ptotv7T ccpmttf t 
rsxxrs.r oi 1£jI Xj/\o X ±rvr ±r o VxNKKJ\xl*XyxNJijooL.x>ii* ivx. 






J- LI 


llKJ 




i-irC± Vox bx\AJ\x\xyij Y 




seq 


id 


no 


71; 


LRIVSYSKKKKKIQLY 




o ~ 


id 




79 • 


bKlVbi bx\i\Kx\Kr b Y I EYL.ASI IRFIFSVNRRKEIQNLS- 


30 














seq 


id 


no 


73 * 


T ,P T\7QVQTTPT^PPQV'T , T?VT A CTTDUTtrcTrMDDf t?tamt o 
xjn.j. voxox\i\xVKr ox IxLxijAolJLKr xr o VNRKKEIQNLiS- 

ov^iMr x\± 




seq 


ia 


no 


/4 ; 


QDLPLSSICQTIVTIYWQ 




seq 


id 


no 


75; 


KQDLPLSSICQTIVTIYWQ 


35 


seq 


id 


no 


76; 


NRTC PFRLFVRPvMLQFTGNKVLDRP 




seq 


id 


no 


77; 


GFWSWKKQDL PLSS I CQT I VT I YWQ 
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seq id no 78; GFWSWKKKQDLPLSSICQTIVTIYWQ 

seq id no 79; G F WS WKKNRTC PFRLFVRRMLQFTGNKVLDRP 

seq id no 80; GFWSWKNRTC PFRLFVRRMLQFTGNKVLDRP 

seq id no 81; YRKTKNQN 

seq id no 82; KYRKTKNQN 

seq id no 83; NTERPKIRTN 

seq id no 84; DETFYKGKK YRKTKNQN 

seq id no 85; DETFYKGKKKYRKTKNQN 

seq id no 86; DETFYKGKKNTERPKIRTN 

seq id no 87; DETFYKGKNTERPKIRTN 

seq id no 88; L S INNYRFQMKF YFRFTSHGS PFTSANF 

seq id no 89; KLSINNYRFQMKFYFRFTSHGSPFTSANF 

seq id no 90; NSVSTTTGFR 

seq id no 91; NI QLAATKKLS INNYRFQMKF YFRFTSHG S PFTSANF 
seq id no 92; NIQLAATKKKLSINNYRFQMKFYFRFTSHGSPFTSANF 
seq id no 93; NIQLAATKKNSVSTTTGFR 
seq id no 94; NIQLAATKNSVSTTTGFR 
seq id no 95; MEHVAPGRMSASPQSPTQ 
seq id no 96; KMEHVAPGRMSASPQSPTQ 

seq id no 97; KWSTWLQAECQHLHSPQRSDKPQQAGLDQQHHCFALDS- 

- SPGPRPVFLQLLGLMGQGRHD 
seq id no 98; WSTWLQAECQHLHSPQRSDKPQQAGLDQQHHCFALDS- 

- SPGPRPVFLQLLGLMGQGRHD 
seq id no 99; TFSVWAEKMEHVAPGRMSASPQSPTQ 
seq id no 100; TF SVWAEKKMEHVAPGRMS AS PQ S PTQ 
seq id no 101; TFSVWAEKKWSTWLQAECQHLHSPQRSDKPQQAGLDQ- 

- QHHCFALDS S PGPRPVFLQLLGLMGQGRHD 
seq id no 102; TFSVWAEKWSTWLQAECQHLHSPQRSDKPQQAGLDQ- 

- QHHCFALDS S PGPRPVFLQLLGLMGQGRHD 
seq id no 103; HKWLKFCLLRLVKESFHE 
seq id no 104; KHKWLKFCLLRLVKESFHE 
seq id no 105; KGGKAKGKKHKWLKFCLLRLVKESFHE 
seq id no 106; KGGKAKGKKKHKWLKFCLLRLVKESFHE 
seq id no 107; KGGKAKGKKNTNG 
seq id no 108; KGGKAKGKNTNG 
seq id no 109; VNNFFKKL 
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seq 


A A 
1Q 


no 




ttt 7"KTM T7 1 TP V XT' T 

KVNNr r KKL 


Seq 


A A 

10 


no 


in. 
Ill ; 


L b Q CjN vkk vnn f f kkl 


seq 


id 


no 


llz ; 


L S QGNVKKKVNNFFKKL 


seq 


id 


no 


■11*) 


GEKNDLQLFVMSDRRYKIYWTVILLNPCGNLHLKTTSL 


seq 


id 


no 


114; 


KGEKNDLQLFVMSDRRYKIYWTVILLNPCGNLHLKTTSL 


seq 


id 


no 


115 ; 


KGKKMICSYS 


seq 


id 


no 


116; 


GKKMICSYS 


seq 


a a 
id 


no 


117 ; 


SSKTFEKKGEKNDLQLFVMSDRRYKIYWTVILLNPCGN- 
-LHLKTTSL 


seq 


id 


no 


118 ; 


SSKTFEKKKGEKNDLQLFVMSDRRYKIYWTVILLNPCGN- 

t ttt it mm T 

-LHLKTTSL 


seq 


A A 


no 


1 1 y ; 


SSKTFEKKKGKKMICSYS 


seq 


A A 


no 


1/ U ; 


bbKTFEKKGKKMlCSYS 


seq 


A A 
lu 


no 


iz 1 ; 


QRKPKRANCVI QRRAKM 


seq 


A A 
lU 


no 


TOO. 


KQRK PKRANC VI QRRAKM 


seq 


A A 

lu 


no 


lz 3 ; 


NKENQKEQTALLYRGGQRCRCVCLRF 


seq 


A A 

id 


no 


lz 3 ; 


NKENQKEQTALLYRGGQRCRCVCLRF 


seq 


A A 

id 


no 


Iz4 ; 


PDYQ P P AKKQRK PKRANC VI QRRAKM 


seq 


id 


no 


125 ; 


PDYQPPAKKKQRKPKRANCVIQRRAKM 


seq 


id 


no 


12 6; 


PDYQPPAKKNKENQKEQTALLYRGGQRCRCVCLRF 


seq 


A A 

lu 


no 


T O "7 . 


PDYQPPAKNKENQKEQTALLYRGGQRCRCVCLRF 


seq 


A A 

lu 


no 


1 o o . 

Izo; 


NLSSLLI 


seq 


A A 

lu 


no 


1 O G . 

izy ; 


TCLPF 


seq 


A /I 

id 


no 


13 U ; 


Q PTFTLRKNL S S LL I 


seq 


A A 

id 


no 


1 O 1 

131 ; 


QPTFTLRKKNLSSLLI 


seq 


id 


no 


132 ; 


Q PTFTLRKKTC L PF 


seq 


A A 

lu 


no 


13 3; 


f\ nmr>mT t*^ T^m/~i t nn 

Q PTFTLRKTC L PF 


seq 


■i A 

lu 


no 


1 J 4 ; 


RATFLLSLWECSLPQARLCLIVSRTGLLVQS 


seq 


A A 

id 


no 


IOC. 

13 b; 


GQHFYWHCGSAACHRRGCV 


seq 


id 


no 


13 6; 


KENVRDKKRATFLL S LWEC S L PQARLCL I VSRTGLLVQS 


seq 


id 


no 


13 7 ; 


KENVRDKKKRATFLLSLWECSLPQARLCLIVSRTGLLVQS 


seq 


id 


no 


13 8 ; 


KENVRDKKKGQHFYWHCGSAACHRRGCV 


seq 


id 


no 


139; 


KENVRDKKGQHFYWHCGSAACHRRGCV 


seq 


id 


no 


140; 


ITHTRWG ITTWDSWSVRMKANWI QAQQNKSLI L S PS FTK 


seq 


id 


no 


141; 


KITHTRWGITTWDSWSVRMKANWIQAQQNKSLILSPSFTK 


seq 


id 


no 


142; 


KLLTPGGELPHGILGQ 
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seq id no 143; LLTPGGELPHGILGQ 

seq id no 144; PPVCELEKITHTRWGITTWDSWSVRMKANWIQAQQNKS- 

-LILSPSFTK 

seq id no 145; PPVCELEKKITHTRWGITTWDSWSVRMKANWIQAQQNKS- 

-LILSPSFTK 
seq id no 146; PPVCELEKKLLTPGGELPHGILGQ 
seq id no 147; P PVCELEKLLT PGGEL PHG ILGQ 
seq id no 148; SLKDELEKMKI 
seq id no 149; SLKDELEKKMKI 
seq id no 150; LGQS S PEKKNKN 
seq id no 151; LGQS S PEKNKN 
seq id no 152; RLRRINGRG SQ I RSRNAFNRSEE 
seq id no 153; EPKVKEEKKT 
seq id no 154; EPKVKEEKKKT 

seq id no 155; EPKVKEEKKRLRRINGRGSQIRSRNAFNRSEE 

seq id no 156; EPKVKEEKRLRRINGRGSQIRSRNAFNRSEE 

seq id no 157; TFRYKGKQHPFFST 

seq id no 158; GPNAPEEKNH 

seq id no 159; GPNAPEEKKNH 

seq id no 160; GPNAPEEKKTFRYKGKQHPFFST 

seq id no 161; GPNAPEEKTFRYKGKQHPFFST 

seq id no 162; MQNTCV 

seq id no 163; KMQNTCV 

seq id no 164; KCKIRVFSK 

seq id no 165; CKIRVFSK 

seq id no 166; FFKRTVQKMQNTCV 

seq id no 167; FFKRTVQKKMQNTCV 

seq id no 168; FFKRTVQKKCKIRVFSK 

seq id no 169; FFKRTVQKCK I RVF S K 

seq id no 170; LPHYLAH 

seq id no 171; CLITWLTN 

seq id no 172; GSTTGLSATPLPHYLAH 

seq id no 173; GSTTGLSATPPLPHYLAH 

seq id no 174; GSTTGLSATPPCLITWLTN 

seq id no 175; GSTTGLSATPCLITWLTN 

seq id no 176; RFADKPRPN 
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seq 


id 


no 


177 ; 


DLPTS PDOTR ICZ PVHV^VF P 

J-' -1—1 J. 4. k— ' X i— 'Ssl A -L V >*->\J IT V 11 V O V Xj X 




seq 


id 


no 


178; 


DSAAGCSGTPRFADKPRPN 




seq 


id 


no 


179; 


D S AAGC SGT P PRF ADK PR PN 




seq 


id 


no 


180; 


DSAAGCSGTPPDLPTSPDOTRSGPVHVSVEP 


5 


sea 


id 


no 


181 ; 


DSAAGCSGTPDLPTSPDOTRSGPVHVWFP 






i r\ 

_L \JL 


nu 




A T-T "DTT ,r rO ft PvMP T PTDP ODD 7"D C"D A P" 1 IV T"i DO A 0»0 CfM?D 
















seq 


id 


no 


"183 ; 


P AH PPT P A OMR T7R T PP Q P P F\7R Q p A'P V P DP a HP Q HTTP - 












-pptca cipnpnrnypTpp p 

XvOXVrAO ruivJJL U V x\ x v^ivx 


10 




id 


nu 


x. O *i / 


P PTPRHPRR T ZV QPQPa\7PPP 






■5 H 

1U 


n 0 


1 oc . 
10 J , 


V/ilKLsllFKFFAnFrj i FAyiNKLiKl FL bKKxWKbRACKP-- 












_ ppAnp^nFRRp^ziQPPRnpn^TPTPpD 




sea 


id 


no 


186 ; 


VAIRGHPRPPPAHPETPAONRLRIPC^RRFVR^RArKP 












- PGAQGSDERRGKASPGRDCDVRTGRP 


15 


seq 


id 


no 


187; 


VAIRGHPRPPRPTRRHPRRIASGSPAVGGR 






-L LI 


IlvJ 


1 ftp • 
loo, 


\ 7 A TDPUDDDD O r n'D'D "LTTDTDTD TA O O C 1 "P> 7\ T T/^» T5 

ViiXKUnr'Kr'Klr 1 KKHrKKlAbborA VCjGR 




seq 


ia 


no 


1 QQ . 

1 0 y ; 


KGRTbGRSLSCCRRPRCRPAVASRSTAPSPRAGSR- 












- KC LLrK 1 bL CjAARPRRTRS ACGDwV AS PPTRS S - 












_ CDTRPP A A CDD A "D OTaTO a T) 


?0 


o try 


J.U. 


no 


±y\j, 


PPPUT T7T7>T 7 








liU 


Xy ± , 


x r vjLrFJJb 1 FKLiKl bUKbijbLCrQ<FRCRFAVASR- 












- 0 1 AJ^b FKAL3bKKUL.ijK 1 bLC-rAAKPRRTRbACGD- 












-WVAbFFIKbbbKl ALGAAbPPARSWSAP 






1 rl 




JL y z , 


ir LrLyFiJbl FFKoKl bvaKbijbLLRRPRCRPAVASR- 


25 










— 9TAPQPR A^QRRPPT.RTQPPA 2i P DP P TP C A CCTWul 












- V A ^PPTP Q q qp nn Tipp a a c p p a t? ct a tq a n 




sea 


id 


no 


193 ; 


YFGGPDSTPPGGGHLERV 

J. JL WVJi ISkJ XXX \JVJ\Ji j JJXiXj V 








1 i\j 


± y *± , 


i r L?»oJrL>c> l FL3ovjrlxjxjxl* V 








no 


1QC . 


XJDT7 A T\"D 




seq 


ia 


no 


1 Q A • 


xjbybbbLiDPPbbR 




seq 


id 


no 


197 ; 


LSQSSELDPPPSSR 




seq 


id 


no 


iyy ; 


LSQSSELDPPHRVADP 




seq 


id 


no 


199; 


LSQSSELDPHRVADP 




seq 


id 


no 


200; 


VILLPEDTPPS 


35 


seq 


id 


no 


201; 


VILLPEDTPPPS 




seq 


id 


no 


202; 


VILLPEDTPPLLRA 
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seq id no 203; VILLPELDPLLRA 
seq id no 204; PSPLP 

seq id no 205; PLLFHRPCS PS PALGATVLAVYRYE 
seq id no 206; LLFHRPCS PS PALGATVLAVYRYE 
seq id no 207; APRPPLGPPSPLP 
seq id no 208; APRPPLGPPPSPLP 

seq id no 209; APRPPLGPPPLLFHRPCSPSPALGATVLAVYRYE 

seq id no 210; APRP PLG P PLLFHRPC S P S PALGATVLAVYRYE 

seq id no 211; TQVLPQGCSLSLLHTTFPHRQVPHILDW 

seq id no 212; PTQVL PQGC SL S LLHTTF PHRQVPH I LDW 

seq id no 213; PLQSFPKDAASAFSTPRFPTDKFPTSWTGSCPGQPHGT- 

- RAFCQPG PEFNAF SAC 
seq id no 214; LQSFPKDAASAFSTPRFPTDKFPTSWTGSCPGQPHGT- 

- RAFCQPG PEFNAF SAC 
seq id no 215; PSPRPQSQPPTQVLPQGCSLSLLHTTFPHRQVPHILDW 
seq id no 216; PSPRPQSQPPPTQVLPQGCSLSLLHTTFPHRQVPHILDW 
seq id no 217; PSPRPQSQPPPLQSFPKDAASAFSTPRFPTDKFPTS- 

-WTGSCPGQPHGTRAFCQPGPEFNAFSAC 
seq id no 218; PSPRPQSQPPLQSFPKDAASAFSTPRFPTDKFPTS- 

-WTGSCPGQPHGTRAFCQPGPEFNAFSAC 
seq id no 219; TAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 220; PTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 221; PRPGPAGGALLPRSLTAFVPHSGHGLPVSSGEPAYTPIP- 

-HDVPHGTPPFC 

seq id no 222; RPGPAGGALLPRSLTAFVPHSGHGL PVS SGEPAYTPI PH - 

-DVPHGTPPFC 

seq id no 223; DLPAVPGPPTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 224; DLPAVPGPPPTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 225; DLPAVPGPPPRPGPAGGALLPRSLTAFVPHSGHGLPVSSG- 

- EPAYTPI PHDVPHGTPPFC 
seq id no 226; DLPAVPGPPRPGPAGGALLPRSLTAFVPHSGHGLPVSSG- 

- EPAYTPI PHDVPHGTPPFC 
seq id no 227; QWGLSWMS 
seq id no 228; NGDCHGCPEGRQSL 
seq id no 229; FTMDRVLTPQWGLSWMS 
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seq id no 23 0; FTMDRVLTPPQWGLSWMS 

seq id no 231; FTMDRVLT P PNGDCHGC PEGRQS L 

seq id no 232; FTMDRVLT PNG DCHGC PEGRQ S L 

seq id no 233; HHPARQCPHCIMHLQTQLIHRNLTGPSQLTSLHRS- 

-PYQIAATPWTTDFAASFFLNPVTPFLLCRRCQGKDV- 

-LCTNARCLSQTSPSHHKALSRTTTQCMNT- 

- TPWLAVRPAKAFPLL 

seq id no 2 3 4 ; PHHPARQCPHC IMHLQTQLIHRNLTGPSQLTSLHRS - 

-PYQIAATPWTTDFAASFFLNPVTPFLLCRRCQGK- 
-DVLCTNARCLSQTSPSHHKALSRTTTQCMNTTP- 
-WLAVRPAKAFPLL 

seq id no 235; HTIQHASVPTASCISKLNSYTEN 

seq id no 23 6; PQVGMRPSNPPHHPARQCPHCIMHLQTQLIHRNLT- 

-GPSQLTSLHRSPYQIAATPWTTDFAASFFLNPVTPFL- 
- LC RRC QG KD VLC TN ARC L S QT S P S HH K AL S RTTTQC - 
-MNTTPWLAVRPAKAFPLL 

seq id no 237; PQVGMRPSNP P PHHPARQCPHC IMHLQTQLIHRNLTGPS- 

-QLTSLHRSPYQIAATPWTTDFAASFFLNPVTPFLLCRRC- 
-QGKDVLCTNARCLSQTSPSHHKALSRTTTQCMNTTPWLA- 
-VRPAKAFPLL 

seq id no 238; PQVGMRPSNPPHTIQHASVPTASCISKLNSYTEN 

seq id no 239; PQVGMRPSNPHTIQHASVPTASCISKLNSYTEN 

seq id no 240; WAARSWCERRAAAVAPLAPWAWGCPAGCTPPVAARAC - 

-AATRPEGWRSPCTH 
seq id no 241; PWAARS WC ERRAAAVA PLAPWAWGC P AGC T P PVAA - 

-RAC AATRPEGWRSPCTH 
seq id no 242; RGLRGAGARGGLRLLRHLRPGLGDALRGVHPPLR- 

-LGPALLPAPRGGEAPAHTDARARRVHGAGGDRGHPGPAAL 
seq id no 243; EEKLARCRPPWAARSWCERRAAAVAPLAPWAWGC PAGC - 

-TP PVAARAC AATRPEGWRS PCTH 
seq id no 244; EEKLARCRPPPWAARSWCERRAAAVAPLAPWAWGCPA- 

- GCT P PVAARAC AATRPEGWR S PCTH 

seq id no 245; EEKLARCRPPRGLRGAGARGGLRLLRHLRPGLGDA- 

- LRG VH P PLRLG PALL P APRGG E AP AHT D ARARRVHGAGG - 
- DRGHPGP AAL 
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seq id no 246; EEKLARCRPRGLRGAGARGGLRLLRHLRPGLGDALRG- 

- VH P PLRLG PALL PAPRGGE APAHTDARARRVHGAGG - 
-DRGHPGPAAL 

seq id no 247; QPPVSPRPRRPGRPRAPPPPQPMVSPRRRTTGPPW- 

-RPPPLQSTMSPPPQALHQAQLLLWCTTAPLPGLPQPQ- 
-PARALHSQFPATTLILLPPLPAIAPRLMPVALTIARYL- 
-LSPPPITALLPSCLLGSLSFSCLFTFQTSSLIPLW- 
- — KIPAPTTTKSCRETFLKW 

seq id no 248; SPGCHLGPGDQAAPGLHRPPSPWCHLGAGQQARLGVHR- 

-PSSPQCHLGLRLCIRLSFYSGAQRHLCQGYHNPSQQEHS- 
-ILNSQPPL 

seq id no 249; KPAPGSTAPQPPVSPRPRRPGRPRAPPPPQPMVSPRR- 

-RTTGPPWRPPPLQSTMSPPPQALHQAQLLLWCTTAP- 
-LPGLPQPQ PARALH SQF P ATTL I LL PPL PA I APRLMPVA - 
-LTIARYLLSPPPITALLPSCLLGSLSFSCLFTFQTS- 
-SLIPLWKIPAPTTTKSCRETFLKW 

seq id no 250; KPAPGSTAPPQPFVSPRPRRPGRPRAPPPPQPMVSPR- 

-RRTTGPPWRPPPLQSTMSPPPQALHQAQLLLWCT- 
-TAPLPGLPQPQPARALHSQFPATTLILLPPLPAIAP- 
-RLMPVALTIARYLLSPPPITALLPSCLLGSLSFSCLF- 
-TFQTSSLI PLWKI PAPTTTKSCRETFLKW 

seq id no 251; KPAPGSTAPPSPGCHLGPGDQAAPGLHRPPSPWCHL- 

-GAGQQARLGVHRPSSPQCHLGLRLCIRLSFYSGA- 
- QRHLCQGYHNP SQQEHS I LNSQPPL 

seq id no 252; KPAPGSTAPSPGCHLGPGDQAAPGLHRPPSPWCHL- 

- GAGQQARLGVHRP S S PQCHLGLRLC I RL SF Y SGAQ - 
-RHLCQGYHNPSQQEHSILNSQPPL 

seq id no 253; QPMVSPRRRTTGPPWRPPPLQSTMSPPPQALHQAQL- 

-LLWCTTAPLPGLPQPQPARALHSQFPATTLILLPPLP- 
-AIAPRLMPVALTIARYLLSPPPITALLPSCLLGSL- 
-SFSCLFTFQTSSLI PLWKI PAPTTTKSCRETFLKW 
seq id no 254; S PWCHLGAGQQARLGVHRP S S PQCHLGLRLC I RL SF - 

-YSGAQRHLCQGYHNPSQQEHSILNSQPPL 
seq id no 2 55; RPPPGSTAPQPMVSPRRR 
seq id no 256; RPPPGSTAPPQPMVSPRRR 
seq id no 2 57; RPPPGSTAPPSPWCHLGA 
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seq id no 258; RPPPGSTAPSPWCHLGA 
seg id no 259; RPRAPPPPSPWCHL 
seq id no 260; RPRAPPPPPSPWC 
seq id no 261; RPRAPPPPAHGVTSAP 
seq id no 262; RPRAPPPPPAHGV 
seq id no 263; APGLHRPPQPMVSP 
seq id no 264; AAPGLHRPQPMVSPR 
seq id- no 265; PGLHRPPPAHGVT 
seq id no 266; APGLHRPPAHGVTS 
seq id no 267; VDRPQHTEWLSWSNLYRIRHQ 
seq id no 268; HYLCTDVAPR 
seq id no 269; HYLCTDVAPPR 

seq id no 270; HYLCTDVAPPVDRPQHTEWLSWSNLYRIRHQ 

seq id no 271; HYLCTDVAPVDRPQHTEWLSWSNLYRIRHQ 
15 seq id no 272; SAYLSPLGTTWLRTCACRLPRPAASCLCTTPSLLW- 

- PRRTCPAGSPRATSSPWRMPAPKSCCTTGLAFTS - 
-PIGLGWRSATASGYARIWPVLSLTCQSWSTSLPSTAVTW 

seq id no 273; PSAYLSPLGTTWLRTCACRLPRPAASCLCTTPSLLWP- 

- RRTC PAG S PRATS S PWRMPAPKS CCTTGLAFTS P - 
20 - IGLGWRSATASGYARIWPVLSLTCQSWSTSLPSTAVTW 

seq id no 274; PAPIFLLWGPLG 

seq id no 275; APIFLLWGPLG 

seq id no 276; L PARAPGP P S AYLS PLGTTWLRTC ACRL PRPAASC L - 

-CTTPSLLWPRRTCPAGSPRATSSPWRMPAPKSCC- 
25 -TTGLAFTSPIGLGWRSATASGYARIWPVLSLT- 

-CQSWSTSLPSTAVTW 
seq id no 277; LPARAPGPPPSAYLSPLGTTWLRTCACRLPRPAAS- 

-CLCTTPSLLWPRRTCPAGSPRATSSPWRMPAPKSCC- 
-TTGLAFTSPIGLGWRSATASGYARIWPVLSLTC- 
30 -QSWSTSLPSTAVTW 

seq id no 278; LPARAPGPPPAPIFLLWGPLG 
seq id no 279; LPARAPGPPAPIFLLWGPLG 
seq id no 280; DLEHHGGVTRHRHR 
seq id no 281; LVSDYSMTPRP 
35 seq id no 282; LVSDYSMTPPRP 

seq id no 283; LVSDYSMTPPDLEHHGGVTRHRHR 
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seq id no 2 84; LVSDY^MTPDLEHHGGVTRHRHR 
seq id no 285; FHHIATDVGPFVRIGFLKIKGKIKGKSLRKPNW- 

- KTQHKLKRALMFL I VKKL 



seq id no 286; PFHHIATDVGPFVRIGFLKIKGKIKGKSLRKPNWK- 

-TQHKLKRALMFLIVKKL 
seq id no 287; PSITLQQMLAPS 
seq id no 298; SITLQQMLAPS 

seq id no 289; TSCNEMNPPFHHIATDVGPFVRIGFLKIKGKIKGKSL- 

-RKPNWKTQHKLKRALMFLIVKKL 
seq id no 290; TSCNEMNPPPFHHIATDVGPFVRIGFLKIKGKIKG- 

-KSLRKPNWKTQHKLKRALMFLIVKKL 
seq id no 291; TSCNEMNPPSITLQQMLAPS 
seq id no 292; TSCNEMNPPPSITLQQMLAPS 
seq id no 293; LEMILFLMTF 
seq id no 294; HPCITKTFLEMILFLMTF 
seq id no 295; HPC ITKTFFLEMILFLMTF 
seq id no 296; HPCITKTFFWR 
seq id no 297; HPCITKTFWR 
seq id no 298; LMFEHSQMRLNSKNAHLPIISF 
seq id no 299; EYGS I I AFLMFEHSQMRLNSKNAHLPI I SF 
seq id no 300; E YG S 1 1 AFFLMFEH SQMRLNSKNAHL P 1 1 SF 
seq id no 301; HLNKGRRLGDKIRAT 
seq id no 302; FHLNKGRRLGDKIRAT 
seq id no 303; VTSGTPFFHLNKGRRLGDKIRAT 
seq id no 304; VTSGTPFFFHLNKGRRLGDKIRAT 
seq id no 305; VTSGTPFFFI 
seq id no 306; VTSGTPFFI 
seq id no 307; CEIERIHFFF 
seq id no 308; CEIERIHFFSK 
seq id no 309; CEIERIHFSK 
seq id no 310; FRYISKSI 
seq id no 311; RYISKSI 
seq id no 312; FKKYEPIFFRYISKSI 
seq id no 313; FKKYEPIFRYISKSI 
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FPDSDQPGPLYPLDPSCLISSASNPQELSDCHYIH- 
-LAFGFSNWRSCPVLPGHCGVQ 

PDSDQPGPLYPLDPSCLISSASNPQELSDCHYIHL- 

-AFGFSNWRSCPVLPGHCGVQ 

LNMFASVFS 

LNMFASVFFS 

LNMFASVFFPDSDQPGPLYPLDPSCLISSASNPQE- 

-LSDCHYIHLAFGFSNWRSCPVLPGHCGVQ 

LNMFASVFPDSDQPGPLYPLDPSCLISSASNPQELS- 

- DCHY IHLAFGF SNWR SC PVL PGHCGVQ 

AMEETVWAVATVETEVEAMEETGWAAMEETEVGAT- 

-EETEVAMEAKWEEETTTEMISATDHT 

LWVRPWLWEWLRWRPKWRLWRRQEWWRLWRRPRWGL- 

-RRRPRWLWRENGRKKRLQK 

YGGDRSRGAMEETVWAVATVETEVEAMEETGWAAM- 
- EETEVGATEETEVAMEAKWEEETTTEMI SATDHT 
YGGDRSRGGAMEETVWAVATVETEVEAMEETGWA- 
-AMEETEVGATEETEVAMEAKWEEETTTEMI SATDHT 
YGGDRSRGGLWVRPWLWEWLRWEPKWRLWRRQEWW- 
- RLWRRPRWGLRRRPRWLWRENGRKKRLQK 
YGGDRSRGLWVRPWLWEWLRWE PKWRLWRRQEWWR - 
- LWRRPRWGLRRRPRWLWRENGRKKRLQK 
EFGGGRRQK 
EFGGRRQK 
RRAKGGGAGASNPRQ 
GRRAKGGGAGASNPRQ 
DVGLREGALELPTRGNKRNVA 
MRGGGGVGGRRAKGGGAGASNPRQ 
MRGGGGVGGGRRAKGGGAGASNPRQ 
MRGGGGVGGDVGLREGALELPTRGNKRNVA 
MRGGGGVGDVGLREGALEL PTRGNKRNVA 
VWQLAGPMLAGWRSLGSWFCRMYGI 
CGSWPALCWRAGGVWAVGSAGCMEYDPEALPAAWGP- 
- AAAATVH PRR 

RRY PC EWG VWQL AG PML AGWR S LG SWFC RM YG I 
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seq id no 338; RRYPCEWGGVWQLAGPMLAGWRSLGSWFCRMYGI 
seq id no 339; RRY PCEWGGCG S WPALCWRAGG VWAVG S AGCME YD - 

-EALPAAWGPAAAATVHPRR 
seq id no 340; RRYPCEWGCGSWPALCWRAGGVWAVGSAGCMEYDPE- 

-ALPAAWGPAAAATVHPRR 

seq id no 341; LWLWAGWTVWWSCGPGEKGHGWPSLPTMALLLLRFSCM- 

-RVASY 

seq id no 342; GLWLWAGWTVWWS CGPGEKGHGWPS L PTMALLLL - 

-RFSCMRVASY 

seq id no 343; GCGCGPAGQYGGAVGLARRGTAGCLPCPPWLCCCCAF- 

-PACGLPGTDGWRGWQGSGCVRVSGSAPWAPGFPFSP- 
-PCPLCGTQPRW 

seq id no 344; CGCGPAGQYGGAVGLARRGTAGCLPCPPWLCCCCAFPACG- 

-LPGTDGWRGWQGSGCVRVSGSAPWAPGFPFSPPC- 
15 -PLCGTQPRW 

seq id no 345; LAFNVPGGLWLWAGWTVWWSCGPGEKGHGWPSLPTMA- 

-LLLLRFSCMRVASY 
seq id no 346; LAFNVPGGGLWLWAGWTVWWSCGPGEKGHGWPSLPTM- 

-ALLLLRFSCMRVASY 
20 seq id no 347; LAFNVPGGGCGCGPAGQYGGAVGLARRGTAGCLPCPP- 

-WLCCCCAFPACGLPGTDGWRGWQGSGCVRVSGSAPW- 

-APGFPFSPPCPLCGTQPRW 
seq id no 348; LAFNVPGGCGCGPAGQYGGAVGLARRGTAGCLPCPPW- 

-LCCCCAFPACGLPGTDGWRGWQGSGCVRVSGSAPWA- 
25 -PGFPFSPPCPLCGTQPRW 
seq id no 349; PPMPMPGQREAPGRQEA 
seq id no 350; GPPMPMPGQREAPGRQEA 
seq id no 351; GHQCQCQGKGRHRADRRPDTAQEE 
seq id no 352; HQCQCQGKGRHRADRRPDTAQEE 
30 seq id no 353; GGHSYGGGPPMPMPGQREAPGRQEA 
seq id no 354; GGHSYGGGG PPMPMPGQREAPGRQEA 
seq id no 355; GGHSYGGGGHQCQCQGKGRHRADRRPDTAQEE 
seq id no 356; GGHSYGGGHQCQCQGKGRHRADRRPDTAQEE 
seq id no 357; APCPQSSGGG 
35 seq id no 358; LPAPSQAAADELDRRPG 
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seq id no 3 88; NRGWVGAGE 

seq id no 389; IEAG 

seq id no 390; VHNYCNMKNRGWVGAGE 

seq id no 391; VHNYCNMKKNRGWVGAGE 

seq id no 392; VHN YCNMKK I E AG 

seq id no 393; VHNYCNMKIEAG 

seq id no 394; QLRCWNTWAKMFFMVFLI IWQNTMF 

seq id no 395; VKKDNHKKQLRCWNTWAKMFFMVFLI IWQNTMF 

seq id no 396; VKKDNHKKKQLRCWNTWAKMFFMVFLI IWQNTMF 

seq id no 397; VKKDNHKKKNS 

seq id no 398; VKKDNHKKNS 

seq id no 399; GAE E SGPFNRQVQLKVHASGMGRHLWNC PAFWS EV 
seq id no 400; HPSPPPEKRS 
seq id no 401; HPSPPPEKKRS 

seq id no 402; HPSPPPEKKGAEESGPFNRQVQLKVHASGMGRHLW- 

-NCPAFWSEV 

seq id no 403; HPSPPPEKGAEESGPFNRQVQLKVHASGMGRHLWN- 

-CPAFWSEV 

seq id no 404; MQVLSKTHMNLFPQVLLQMFLRGLKRLLQDLEKSKKRKL 
seq id no 405; RCKSARLI 

seq id no 406; VQTQPAIKKMQVLSKTHMNLFPQVLLQMFLRGLKRLLQ- 

-DLEKSKKRKL 

seq id no 407; VQTQPAIKKKMQVLSKTHMNLFPQVLLQMFLRGLKRL- 

-LQDLEKSKKRKL 
seq id no 408; VQTQ PA I KKRCKS ARLI 
seq id no 409; VQTQ PA I KRC KS ARL I 
seq id no 410; ARSGKKQKRKL 
seq id no 411; ARSGKKQKKRKL 
seq id no 412; ARSGKKQKKENFS 
seq id no 413; ARSGKKQKENFS 
seq id no 414; KASARSGKSKKRKL 
seq id no 415; KASARSGKKSKKRKL 
seq id no 416; KASARSGKKAKKENSF 
seq id no 417; KASARSGKAKKENSF 
seq id no 418; HLNKGRRLGDK I RAT 
seq id no 419; VTSGTPFFHLNKGRRLGDKI RAT 
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seq id no 420; VTSGTPFFFHLNKGRRLGDKIRAT 
seq id no 421; VTSGTPFFFI 
seq id no 422; VTSGTPFFI 

seq id no 423 ; VTLLYVNTVTLAPNVNMESSRNAHSPATPSAKRK- 

- DPDLTWGGFVFFFCQFH 
seq id no 424; KCRCK PNFF VTLL YVNTVTL A PNVNME S S RNAH S P - 

-ATPSAKRKDPDLTWGGFVFFFCQFH 
seq id no 425; KCRCKPNFFF VTLLYWTVTLAPNVNMES SRNAH - 

- SPATPSAKRKDPDLTWGGFVFFFCQFH 
seq id no 426; KCRCKPNFFL 
seq id no 427; KCRCKPNFL 
seq id no 429; LVKKLKEKKMNWIL 
seq id no 430; LVKKLKEKKKMNWIL 
seq id no 431; LVKKLKEKKR 
seq id no 43 2; LVKKLKEKR 
seq id no 433; AAIVKDCCR 
seq id no 434; SQPASILGRKL 
seq id no 4 35; SQPAS I LGKRKL 
seq id no 436; SQPAS ILGKAAIVKDCCR 
seq id no 437; SQPAS I LGAAI VKDCCR 
seq id no 459; NTWAKMFFMVFLI I WQNTMF 

Examples of cancers particularly suitable for treatment 
with one or a combination of several of this compounds 
are: colorectal cancer, breast cancer, small-cell lung 
cancer, non small-cell lung cancer, liver cancer (primary 
and secondary), renal cancer, melanoma, ovarian cancer, 
cancer of the brain, head and neck cancer, pancreatic 
cancer, gastric cancer, eosophageal cancer, prostate 
cancer and leukemias and lymphomas. 

Below are listed some examples of where these mutations 
may result in gene products that result in development of 
tumours : 
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Development of colorectal cancers are believed to result 
from a series of genetic alterations. Deleted in 
colorectal cancer (DCC) gene (seq id nos 30-34), Human 
cysteine protease (ICErel-Ill) gene (seq id nos 394-398 
and 459), Human putative mismatch repair /binding protein 
(hMSH3) gene (Seq id hos 134-147), Human hMSH6 gene (seq 
id nos 201-204 and 295-299), Human n-myc gene (seq id nos 
190-195), Human TGF02 (hTGFp2) gene (seq id nos 22-29), 
Human p53 associated gene (seq id nos 287-294) may be 
involved in colorectal cancer. 

Human breast cancer susceptibility ( BRCA2 ) (seq id nos 
35-94) and Human BRCAl-associated RING domain protein 
( BARD1 ) gene (seq id nos 404-413) are involved in breast 
cancer and ovarian cancer Human hMSH6 gene (seq id nos 
201-204 and 295-299) may be involved in brain tumours. 

Gene alteration are frequent in many types of 
adenocarcinomas , below are listed some genes that are 
mutated in many cancers : 

Human breast cancer susceptibility (BRCA2) gene (seq id 
nos 35-94) , Deleted in colorectal cancer (DCC) gene (seq id 
nos 30-34), Human putatative mismatch repair /binding 
protein (hMSH3) gene (seq id nos 134-147), Human hMSH6 
gene (seq id nos 201-204 and 295-299), human N-MYC gene 
(seq id no 190-195), Human TGFb2 (hTGFb2) gene (seq id nos 
22-29), Human p53 associated gene (seq id nos 287-294), 
Human MUC1 gene (seq id nos 248-267), Human germline n-myc 
gene (seq id nos 184-195), Human Wilm's tumour (WIT-1) 
associated protein (seq id nos 388-393), Human nasopharynx 
carcinoma EBV BNLF-1 gene (seq id nos 205-211), Human 
transforming growth factor-beta inducted gene product 
(BIGH3) seq id nos 228-233). 
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Many of the mutated genes may result in development of 
leukemias and lymphomas: Human neurof ibromin (NF1) gene 
(seq id nos 178-183), b-raf oncogene (seq id nos 172-177), 
Human protein- tyrosine kinase (JAK1) gene (seq id nos 
268-272), Human protein- tyrosine kinase (JAK3) gene (seq 
id nos 273-280) are examples. 

Genes involved in malignant melanoma : Human malignant 
melanoma metastasis-supressor (hKiSS-1) gene (seq id nos 
331-337), Genes involved in metastasis: Human 
metastasis-assosiated mtal (hMTAl) gene (seq id nos 
360-365) . 

Cell cycle control and signal transduction is strikcly 
regulated. Frameshift mutations in these genes may result 
in uncontrolled cell growth. Examples of genes which may 
be suseptable are: Human protein tyrosine phosphatase 
(hPTP) gene (seq id nos 95-102), Human kinase (TTK) gene 
(seq id nos 109-121), Human transcriptional repressor 
(CTCF) gene (seq id nos 122-128), Human cell cycle 
regulatory protein (ElA-binding protein) p3 00 gene (seq id 
nos 212-219) , Human tranforming growth factor-beta 
inducted gene product (BIGH3) (seq id nos 228-233), Human 
FLt4 gene (for transmembrane tyrosinase kinase (seq id nos 
281-286), Human G protein-coupled receptor (hGPRl) gene 
(seq id nos 317-322), Human transcription factor (hITF-2) 
gene (seq id nos 32 9-330) , Human telomerase-associated 
protein TP-1 (hTP-1) gene (seq id nos 338-351), Human 
transcription factor TFIIB 90 kDa subunit (hTFJ3IIB90) gene 
(seq id nos 366-373), Human FADD- homologous ICE/CED-31ike 
protease gene (seq id nos 129-133) 

Mutations in DNA synthesis or -repair enzymes may also 
lead to uncontrolled cell growth. Human DNA topoisomerase 
II (top2) gene (seq id nos 103-108) and Human putative 
mismatch repair /binding protein (hMSH3) gene (seq id nos 
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134-147) and (hMSH6) gene (seq id nos 201-204 and 
205-299) . 

The following are tumour suppressor genes, Human 
retinoblastoma binding protein 1 isoform I (hRBPl) gene 
(seq id hos 148-158), Human neurof ibromin (NF1 ) gene (seq 
id nos 178-183), Human p53 associated gene (seq id nos 
287-294), Human retinoblastoma related protein (pl07) gene 
(seq id nos 312-316), Human tumour suppressor (hLUCA-1) 
gene (seq id nos 374-381), Mutations in these genes may 
result in development of cancer. 

The following are oncogenes, proto-oncogenes or putative 
oncogenes ; Human germline n-myc gene (seq id nos 
184-189), Human n-myc gene (seq id nos 190-195), Human can 
(hCAN) gene (seq id nos 300-302), Human dek (hDEK) gene 
(seq id nos 309-311), b-raf oncogene (seq id nos 172-177), 
Human DBL (hDBL) proto-oncogene / Human MCF2PO (hMCF2PO) 
gene (seq id nos 303-308). Frameshift mutations in these 
genes may lead to development of cancer. 



BIOLOGICAL EXPERIMENTS 



Description of the Fig ures 
FIG. 1: 

It has been demonstrated that T cells from normal donors 
can be stimulated with a mixture of peptides containing 
both mutant BAX and mutant TGF(5rii peptides. Peptide 
mixture dependent T cell proliferation in blood samples 
from six different donors are shown in figure 1. The 
results were obtained by stimulating peripheral blood 
mononuclear cells (PBMCs)from each donor with a mixture of 
mutant BAX peptides (seq id nos 1,9-12) and mutant TGFpRII 
peptides (seq id nos 15-21). The concentration of each 
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cells were isolated from the cell product before DCs were 
derived using standard methods. 

FIG. 4: 

Figure 4 shows the capability of T cells obtained from 
ascites fluid of a pancreatic cancer patient to recognise 
and proliferate to different synthetic peptides derived 
from mutant BAX (seq id nos 1,9-12) and mutant TGFf3RII (seq 
id nos 15,17-21). The T cell line was obtained after 
expansion of T cells present in the ascites fluid of a 
patient with pancreatic adenocarcinoma. The T cell line was 
expanded in vitro by culturing with 100 U/ml recombinant 
interleukin-2 (rIL-2) (Amersham, Aylesbury, UK) for one 
week before beeing tested in a proliferation assay. 

Autologous, irradiated (3 0Gy) PBMCs were seeded 5 x 104 in 
u-bottomed 9 6 -well plates (Costar, Cambridge, MA) and 
pulsed with single synthetic peptides at 20 piM for 2h. The 
T cells were added 5 x 104 per well and the plates were 
incubated for four days at 37°C with addition of 18.5 x 104 
Bq/mL 3H- thymidine for the last 12 hours before harvesting. 
The plates were counted in a liquid scintillation counter 
(Packard Topcount) . Data represent specific proliferation 
to the different synthetic peptides and values are 
expressed as the mean of triplicate cultures. These results 
show that T cells isolated from a pancreatic cancer patient 
are capable of responding to a panel of peptides carrying 
amino acid sequences derived from mutant BAX and TGFPRII. 



30 FIG. 



Figure 5 further demonstrates the capability T cells from 
another pancreatic cancer patient to recognise and 
proliferate to different synthetic peptides derived from 
mutant BAX and mutant TGFPRII. The T cell line was obtained 
35 after expansion of T cells present in the ascites fluid of 
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individual peptide in the mixture was 2 0 pJA. After two 
weeks, and weekly thereafter, the bulk cultures were 
restimulated with autologous PBMCs pulsed with 10-25 \xM of 
the peptide mixture. After 4-5 restimulations the bulk 
cultures were tested in a standard proliferation assay with 
PBMCs alone or as a control or PBMCs pulsed with 2 5 JUM of 
the peptides as antigen presenting cells (APCs) . 

FIG. 2: 

It has further been found that T cell clones can be 
generated against separate peptides of the mixture used in 
the bulk stimulation experiments. Figure 2 shows the 
proliferation of T cell clone 521-2 which was obtained by 
cloning the bulk culture from donor 1 (figure 1) by seeding 
5 cells per well in U-bottomed, 96-well microtiter plates 
and using autologous PBMCs pulsed with 25 jjM of the mutant 
BAX peptide with seq id no 12 as feeder cells. Autologous 
B-lymphoblastoid cells were used as APCs in the 
proliferation assay. 

FIG. 3: 

In figure three it is shown that mutant BAX peptides and 
mutant TGFPRII peptides can be used to stimulate T cells 
(PBMCs) from a patient with breast cancer. Dendritic cells 
(DCs) from the same cancer patient were used as APCs. The T 
cell stimulation (figure 3) was obtained by pulsing DCs 
separately with a mixture of mutant BAX peptides (seq id 
nos 1,9-12) and a mixture of mutant TGFpRII peptides (seq 
id nos 15-21) followed by addition of autologous PBMCs and 
10 ng/ml tumour necrosis factor. The concentration of each 
peptide in the mixtures used for pulsing was 25 |1M. The 
PBMCs and the DCs were obtained by leukapheresis from a 
patient with breast cancer who had been on a granulocyte 
colony stimulating factor (G-CSF) treatment. The CD34+ 
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a patient with pancreatic adencarcinoma. The experiment was 
set up in the same way as described above. Data represent 
specific proliferation to the different synthetic peptides 
and values are expressed as the mean of triplicate 
5 cultures. 

In order to investigate the T cell response from the latter 
pancreatic cancer patient, responding T cells were cloned. 
Peritoneal macrophages were irradiated (30 Gy) and plated 1 

10 x 104 into U-bottomed 96-well plates (Costar) together with 
25 jxM of each peptide. T cell blasts were counted in a 
microscope and added 5 blasts per well together with 100 
U/ml human recombinant interleukin-2 (rIL-2) (Amersham, 
Aylesbury, UK) in a total volume of 200 mL . After 14 days T 

15 cell clones were transferred onto 24-well plates (Costar) 
with 1 mg/mL phytohemagglutinin (PHA, Wellcome, Dart ford, 
UK), 100 U/ml rIL-2 and allogeneic, irradiated PBMCs as 
feeder cells and screened for peptide specificity after 7 
and 14 days. 

20 

FIG. 6: 

T cell clone 520.5, 520.7 and 520.8 were selected for 
further characterisation and express the cell surface 
phenotype CD3+, CD8+ and TcR+ . Figure 6 shows the 

25 recognition and cytotoxicity of T cell clone 520.5, 520.7 
and 520.8 against peptide-pulsed autologous target cells 
pulsed with the seq id no 10 peptide. Autologous 
Epstein-barr virus transformed B-cells (EBV) were labelled 
with 3H-thymidine (9.25 x 104 Bq/ml) over night, washed 

30 once and plated 2500 cells per well in 96-well plates with 
or without 25 mM of synthetic peptide (seq id no 10) and 1% 
DMSO in medium. After 30 minutes incubation at 37°C the 
plates were washed before addition of T cells. The plates 
were further incubated at 37°C for 4 hours and then 

35 harvested before counting in a liquid scintillation counter 
(Packard Topcount) . Data represent percent specific lysis 
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of 3H-thymidine labelled peptide pulsed target cells at an 
effector/target ratio of 10/1. Values are expressed as the 
mean of triplicate cultures. These results demonstrate that 
the three different T cell clones obtained from ascites 
fluid of a pancreatic carcinoma patient, exhibit specific 
cytotoxicity of autologous EBV targets pulsed with the 
relevant peptide (seq id no 10) derived from mutant BAX. 

FIG. 7: 

Figure 7 shows the cytolytic properties of three different 
T cell clones obtained from the same patient. These T cell 
clones were cultured and expanded as described above, but 
they were generated against a synthetic peptide the seq id 
no 17 peptide carrying amino acid sequences derived from 
mutant TGFpRII . T cell clone 538.1, 538.3 and 538.4 all 
show the cell-surface phenotype CD3+, CD8+ and TcR+ . The 
experimental conditions were as described above (figure 6) . 
Data represent percent specific lysis of 3H- thymidine 
labelled peptide pulsed target cells pulsed with the seq id 
no 428 peptide at an effector/ target ratio of 10/1. Values 
are expressed as the mean of triplicate cultures. These 
results demonstrate that the three different T cell clones 
obtained from ascites fluid of a pancreatic carcinoma 
patient, exhibit specific cytotoxicity of autologous EBV 
targets pulsed with the relevant peptide (seq id no 428) 
derived from mutant TGFpRII. 

FIG. 8: 

Figure 8 shows the specificity of two CD4+ T cell clones, 
IMT8 and IMT9, obtained from a tumour biopsy taken from a 
patient with an adenocarcinoma localised to the proximal 
colon. Immunohistochemistry revealed that the patient had 
an abundant infiltrate of predominantly CD4+ T cells, many 
of which carried activation markers. In areas of CD4 T cell 
infiltration islands of HLA DR positive tumour cells were 
observed. The T cell clones were obtained from the 
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component of tumour infiltrating lymphocytes which grew out 
of the biopsy following culture in medium containing 15 
U/ml of recombinant human IL-2 for 16 days. The T cells 
from this culture were cloned by limiting dilution (1 
5 cells/well) in Terasaki plates with irradiated peptide 

pulsed APC and 100 U/ml of IL-2. Pulsing of autologous APC 
was performed with a mixture of the TGF(3rii frameshift 
peptides with sequence identity no. 15, 17 and 18 at 1 
fig /ml of each peptide in the presence of 3 fig/ml of 

10 purified human (32 microglobulin and lOng/ml of recombinant 
human TNFOt for 3 hrs at 37 °C . Of the 14 clones that could 
be expanded preliminary tests showed that two of the clones 
were reactive with the peptide mixture used for cloning. 
After expansion the clones were screened for reactivity 

15 with the single peptides in a standard proliferative assay. 
The results show that IMT8 and IMT9 both react specifically 
with the TGFpRII frameshift peptide with seq. id. no. 17, 
no reactivity was observed with the two other frameshift 
peptides tested. 

20 

The figure (Fig. 8) depicts the results of conventional T 
cell proliferative assays, where cloned T cells (5xl0 4 ) and 
irradiated APC (5xl0 4 ) were cocultured for 3 days in 
triplicates before harvesting. To measure the proliferative 
25 capacity of the cultures, 3 H- thymidine (3,7xl0 4 Bq/well) 
was added to the culture overnight before harvesting) 
Values are given as mean counts per minute (cpm) of the 
triplicates . 

30 FIG. 9: 

Figure 9 demonstrates that the specific reactivity of the 
two T cell clones IMT8 and IMT9 against the peptide with 
seq. id.no. 17 is completely blocked by treatment of the 
cells with an antibody that specifically binds to HLA-DR 
35 molecules, since the reactivity after blocking is the same 
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as the background reactivity of the clones with APC in the 
absence of the peptide. On the other hand antibodies to the 
HLA class II isotypes HLA-DQ and -DP failed to block the 
reactivity of the clones with peptide pulsed APC. This 
experiment unequivocally identifies HLA-DR as the molecule 
responsible to present the peptide to these two T cell 
clones. Antibody blocking experiments were performed using 
the homozygous EBV transformed cell line 9061 (IHWS9 
nomenclature) as APC. The APC were pulsed with peptide at a 
concentration of 15 jig/ml for 1 hr.at 37 °C before addition 
of blocking antibodies L243 (pan-DR antibody) , SPVL3 
(pan-DQ antibody) and B7.21 (pan-DP antibody) at 10 ug/ml . 
Unpulsed APC and APC pulsed with peptide in the absence of 
blocking antibody served as negative and positive controls 
respectively. Results are expressed as in Figure 8. 

FIG. 10: 

The patient IMT was HLA typed and turned out to be HLA: 
Al,2; B7,8; DR3 , 14 ; DQ1,2. To determine which of the HLA-DR 
molecules that were responsible for presentation of the 
peptide with seq. id. no. 17, a panel of HLA workshop 
derived homozygous BCL cell lines were obtained and pulsed 
with the peptide with seq. id. no. 17. Figure 10 describes 
the identification of HLA-DR14 (DRA*0102, DRB*1401) as the 
HLA-DR molecule responsible for presentation of the peptide 
with seq. id. no. 17 to the T cell clones IMT8 and IMT9 . A 
specific proliferative response was observed when peptide 
was presented by the autologous EBV transformed cell line 
(Auto APC) and by cell lines 9054 (EK) and 9061 (31227ABO) , 
both of which expressed DR14 as the only DR molecule on 
their surface. The homozygous cell line gave higher 
responses, reflecting a higher level of expression of the 
relevant class II/peptide complexes due to the effect of a 
double dose of the genes encoding this DR molecule. No 
response was obtained when the peptide was presented by 
cell lines expressing HLA-DR3 (9018, L0081785) , which 
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represents the other DR molecule expressed by the patients 
APC, nor by irrelevant HLA-DR molecules. The experiment was 
performed as described in figure 9, with the exception that 
no antibody blocking was performed. Results are expressed 
as in Figure 8. 

FIG. 11: 

Figure 11 describes the dose response curves obtained by 
pulsing the cell line 9054 with increasing concentrations 
of the peptide with seq. id. no. 17. Both IMT 8 and IMT9 
demonstrate a dose dependent increase in the proliferative 
response to the peptide. Results were performed as 
described in Fig. 9 and 10 with the peptide concentrations 
indicated on the Figure (Fig. 11). Results are expressed as 
in Fig. 8. 

FIG. 12: 

Figure 12 describes the reactivity of a cell line generated 
by in vitro stimulation of T cells isolated from peripheral 
blood from a healthy blood donor (Donor 2892) by weekly 
stimulation with irradiated autologous dendritic cells 
pulsed with the peptides with sequence identity numbers 16, 
17 and 21. A specific response above background values was 
obtained when the T cells were co- incubated with autologous 
dendritic cells pulsed with the peptide with seq. id. no. 
21. No activity could be detected in the culture after the 
first and second in vitro stimulation. These data 
demonstrate that the T cell repertoire of normal 
individuals contain a few precursor cells that have the 
capacity to recognise this frameshift peptide derived from 
a mutation in TGFPrii that does not occur in normal people. 
In two other blood donors (#2706 and #2896), the level of 
precursor cells with the relevant specificity was too low 
to be detected. The results are expressed as spots per 10 4 
T cells tested in a conventional IFNg ELISPOT assay. This 
assay enumerates the number of cells present in a mixture 
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of cells that are capable of specifically reacting with a 
defined antigen. Briefly 10 7 T cells (non adherent cells) 
were stimulated weekly with 2-5xl0 6 irradiated peptide 
pulsed autologous dendritic cells (DC) as APC. The DC were 
generated from the adherent cell population by culture for 
one week in recombinant human GM-CSF and IL-4 according to 
standard protocols as described in the literature. After 
peptide pulsing overnight at 15 ug/ml of peptide, full 
maturation of the DC was obtained by culture with 
recombinant TNFoc. ELI SPOT was performed according to 
standard published protocols using 10 4 cultured T cells per 
well in duplicate and 10 4 peptide pulsed or unpulsed DC as 
APC. The results are expressed as mean number of spots per 
10 4 T cells. 



FIG. 13: 

Figure 13 shows the results of in vitro stimulation of T 
cells from a healthy blood donor (Donor 322) with peptides 
with sequence identity number 15-21. In vitro culture was 
performed as described in Figure 12. A proliferative 
response above background values was seen when the T cell 
culture primed with a mixture of the peptides with seq. 
id. no. 16 and 21 was stimulated with peptide 21 and the 
culture primed with the peptide with seq. id. no. 17 was 
stimulated with the same peptide. These results demonstrate 
that normal blood donors have small numbers of circulating 
T cells specific for these frameshift peptides, and that it 
is possible to expand these cells in culture by stimulation 
with frameshift peptides. These results also confirmed the 
results shown in Fig. 8-11, demonstrating that the peptide 
with seq. id. no. 17 is immunogenic in humans, and indicate 
that the peptide with seq. id. no. 21 may also be used as a 
cancer vaccine in humans. The results are expressed as 
described in Fig. 8. 
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FIG. 14: 

The results shown in Figure 14 demonstrate that CD8 + T 
cells specific for HLA class I epitopes can be generated 
from T cells present in the T cell repertoire of a healthy 
5 blood donor (donor 905) . No reactivity above background was 
seen with any of the peptides after the second round of in 
vitro restimulation. After the fourth restimulation, the 
frequency of T cells specif ic for the peptide with seq. id. 
no. 428 had increased from undetectable levels to 

10 approximately 2,5 % of the cells. These results demonstrate 
that CTL precursors of the CD8+ phenotype are present in 
the unprimed T cell repertoire of healthy blood donors. 
Such T cells may be expanded in vitro by specific 
stimulation with the peptide with seq. id. no. 428. This 

15 forms the basis for using this peptide as a cancer vaccine 
to elicit cytotoxic T cells specific for framshift peptides 
in cancer patient having such mutations. T cells were 
generated by weekly restimulation of T cells isolated from 
peripheral blood and stimulated with peptide pulsed 

20 autologous DC as described in Fig. 12, with the exception 
that 11-7 and 11-2 was added during culture according to 
standard procedures for generating cytotoxic T cells (CTL) 
of the CD8 phenotype. The peptides used were peptides with 
sequence identity number 428, 439, 446 and 451. Cells were 

25 tested in ELISPOT assay as described in Fig. 12. The results 
are expressed as described in figure 12 . 



30 The peptide with seq. id. no. 17 was selected and designed 
to contain binding motifs for both several HLA class I and 
HLA class II molecules. These peptides thus contains 
epitopes both for CD4+ and CD8+ T cells, and was predicted 
to elicit both CD4 and CD8 T cell responses in cancer 

35 patient provided processing of the aberrant TGF(3RII protein 
naturally occurring in cancer cells would take place and 
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result in an overlapping peptide. This has now been proven 
for CD4 T cells by the results in Fig. 8-11. These results 
have the following implication: 

1) The results in Figure 8 prove that the mutated form of 
TGFPRII Receptor which occurs in a high proportion of 
cancer patients with defects in their mismatch repair 
machinery is a tumour specific antigen. 



10 2) The antigen specificity of the infiltrating T cells 

commonly observed in colorectal cancer are generally not 
known. The results in figure 8 demonstrate that one 
component of the T cells constituting the population of 
tumour infiltrating lymphocytes in this patients tumour is 

15 specific for a frameshift mutation, demonstrating that 
TGF0RII frameshift peptides are immunogenic in vivo, 
occasionally giving rise to spontaneous T cell activation. 



20 



25 



30 



35 



3 )It follows from this observation that processing of the 
non-functional form of the TGF(3RII Receptor that is formed 
by the common frameshift mutation is processed. This 
processing may take place either in the tumour cell as part 
of natural breakdown of the aberrant protein, or after the 
tumour cell itself or a released form of the receptor has 
been taken up by a professional APC or both. 

4) The results in Figure 8 also indicate that the peptide 
with seq. id. no. 17 is capable of binding to an HLA class 
II molecule, since pulsing of APC with this peptide results 
in a specific proliferative response against the peptide, 
and since CD4 T cell responses always are class II 
restricted. That this is the case is demonstrated by the 
results of the experiment shown in Figure 9. Here it is 
shown that the specific response against the peptide with 
seq. id. no. 17 is completely blocked by an antibody to 
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HLA-DR, but not with antibodies to the two other HLA class 
II molecules, HLA-DQ and -DP. Furthermore, by using a panel 
of standard homozygous Epstein Barr Virus (EBV) transformed 
B Cell Lines (BCL) covering the relevant HLA class II 
5 molecules present on the patients own APC, we were able to 
identify the class II molecule responsible for presentation 
of the peptide with seq. id. no. 17 to TLC IMT8 and IMT9 as 
being HLA-DR 14. Together these findings fit extremely 
well with the immunohistological observations made in 

10 parallel sections taken from the same tumour biopsy, where 
we could show that activated CD4+ T cells were abundant in 
the proximity of tumour cells that had been induced to 
express HLA-DR. molecules. The results in Figure 11 
demonstrate that these T cell clones are capable of 

15 mounting a proliferative response over a range of peptide 
doses ,and that the responses are dose dependent. 

5) Since these T cell clones were obtained by cloning T 
cells isolated from a tumour biopsy, another implication of 

20 our finding is that activated T cells specific for the 

peptide with seq. id. no. 17 are capable of homing to the 
tumour tissue after activation. 

6) Since the peptide with seq. id. no. 17 is a tumour 
25 specific antigen, and since frameshift mutations giving 

rise to this peptide or peptides with overlapping sequences 
are commonly found in cancers with defects in enzymes that 
are part of the mismatch repair machinery, this peptide may 
be used as a vaccine to elicit T cell response in cancer 

30 patients or patients at high risk for developing cancer. 

Such T cell responses may potentially influence the growth 
of an existing tumour or prohibit regrowth of tumour after 
surgery and other forms of treatment or be given to 
patients with an inheritable form of cancer where a defect 

35 mismatch enzyme is detected or suspected and that have a 
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high chance of developing a cancer where this precise 
mismatch repair mutation will occur. 



5 

Synthesis 

The peptides were synthesised by using continuous flow 
solid phase peptide synthesis. N-a-Fmoc-amino acids with 

10 appropriate side chain protection were used. The 
Fmoc-amino acids were activated for coupling as 
pentaf luorophenyl esters or by using either TBTU or 
diisopropyl carbodiimide activation prior to coupling. 
20% piperidine in DMF was used for selective removal of 

15 Fmoc after each coupling. Cleavage from the resin and 

final removal of side chain protection was performed by 
95% TFA containing appropriate scavengers. The peptides 
were purified and analysed by reversed phase (C18) HPLC. 
The identity of the peptides was confirmed by using 

20 electro-spray mass spectroscopy (Finnigan mat SSQ710) . 

The peptides used for in vitro studies of T cell 
stimulation were synthesised by this method. 

25 Several other well known methods can be applied by a person 
skilled in the art to synthesise the peptides. 

Examples of the method for determining new frameshift 
30 mutation peptides . 

In this Example, the BAX gene is used to illustrate the 
principle. 

35 In each of the steps listed below, the 1st line is the gene 
sequence and 2nd line is amino acid sequence. 
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In the steps 2-5, the outlined sequences represent the 
mutant part of the protein. 



5 Step one: 

Normal BAX. 

A TG GGG GGG GA G GCA CCC GAG CTG GCC CTG GAC CCG GTG 
0 MGGEAPELALDPV.. 



Step two : 

15 

1G deleted from gene sequence. 

ATG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC CTC 
M G G RHPSWPWTRC 

20 

AGG ATG CGT CCA CCA AGA AGC TGA 
R M R P p R s stop 

25 

Step three: 

2G deleted from gene sequence. 

30 

ATG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 
M G G GTRAGPGPGA 

TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
SGCVHQ EAERVSQA 



CAT CGG GGA CGA ACT GGA CAG TAA 
H R G R T G Q stop 
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Step four: 



1G inserted in gene sequence. 

ATG GGG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 
MGGGGTRAGPGPGA 



10 



TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
SGCVHQEAERVSQA 

CAT CGG GGA CGA ACT GGA CAG TAA 
H R G R T G Q Stop 



15 



20 



25 



Step five: 

2G inserted in gene sequence. 

ATG GGG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC 
MGGGRHPSWPWTRC 

CTC AGG ATG CGT CCA CCA AGA AGC TGA 
LRMRPPRS Stop 



30 



In the next Example, the TGF(3RII gene is used to illustrate 
the principle. 

In each of the steps listed below, the 1st line is the gene 
sequence and 2nd line is amino acid sequence. 
In the steps 2-5, the outlined sequences represent the 
mutant part of the protein. 



35 



40 



Step one: 
Normal TGF0RII. 

GAA AAA AAA AA G CCT GGT GAG ACT TTC TTC ATG TGT TCC . . . 
EKKKPGETFFMCS. 



45 
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Step two: 

1A deleted from gene sequence. 

5 GAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT GTA 

EKKSLVRLSSCVPV 

GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG AAT 
ALMSAMTTSSSQKN 

0 

ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 
ITP AILTCC stop 



15 

Step three: 

2A deleted from gene sequence. 

20 GAA AAA AAA GCC TGG TGA 

E K K A W Stop 



25 Step four: 

1A inserted in gene sequence. 

GAA AAA AAA AAA GCC TGG TGA 
30 E K K K A W stop 



Step five: 

35 

2A inserted in gene sequence. 

GAA AAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT 
EKKKSLVRLSSCVP 

40 

GTA GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG 
VALMSAMTTSSSQK 

AAT ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 
45 N ITPAILTCC stop 



50 



Thus the peptides of the invention may be used in a 
method for the treatment of cancers with cancer cells 
harbouring genes with frameshift mutations, which 
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treatment comprises administering at least one peptide of 
the present invention in vivo or ex vivo to a human 
patient in need of such treatment. 



5 In another embodiment the peptides of the invention may 
be used to vaccinate a human being disposed for cancers 
with cancer cells harbouring genes with frameshift 
mutations, by administering at least one peptide of the 
present invention to said human being. 

10 

It is further considered to be an advantage to administer 
to a human individual a mixture of the peptides of this 
invention, whereby each of the peptides of the invention 
can bind to different types of HLA class I and/or class 
15 II molecules of the individual. 



It is further anticipated that the power of an anticancer 
vaccine or peptide drug as disclosed in the above 
mentioned PCT/NO92/00032 application, can be greatly 

20 enhanced if the peptides of the present invention were 
included. Thus in another embodiment of the present 
invention peptides of the present invention are 
administered together with, either simultaneously or in 
optional sequence, with the peptides disclosed in 

25 PCT/NO92/00032. 

It is considered that the peptides may be administered 
together, either simultaneously or separately, with 
compounds such as cytokines and/or growth factors, i.e. 
30 interleukin-2 (IL-2), interleukin-12 (IL-12), granulocyte 
macrophage colony stimulating factor (GM-CSF) , Flt-3 
ligand or the like in order to strengthen the immune 
response as known in the art . 



35 



The peptides according to the present invention can be 
used in a vaccine or a therapeutical composition either 
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alone or in combination with other materials, such as for 

instance standard adjuvants or in the form of a 

lipopeptide conjugate which as known in the art can 

induce high-affinity cytotoxic T lymphocytes, (K. Deres, 
5 Nature, Vol.342, (nov.1989)). 

The peptides according to the present invention may be 
useful to include in either a peptide or recombinant 
fragment based vaccine. 

10 

The peptides according to the present invention can be 
included in pharmaceutical compositions or in vaccines 
together with usual additives, diluents, stabilisers or 
the like as known in the art. 

15 

According to this invention, a pharmaceutical composition 
or vaccine may include the peptides alone or in 
combination with at least one pharmaceutically acceptable 
carrier or diluent. 

20 

Further a vaccine or therapeutical composition can 
comprise a selection of peptides which are fragments of 
the mutant proteins arising from insertion or deletion of 
bases in a repeat sequence of the gene. 

25 

Further a vaccine composition can comprise at least one 
peptide selected for one cancer, which vaccine would be 
administered to a person carrying a genetic disposition 
for this particular cancer. 

30 

Further a vaccine composition can comprise at least one 
peptide selected for one cancer, which vaccine would be 
administered to a person belonging to a high risk group 
for this particular cancer. 



35 
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The cancer vaccine according to this invention may 
further be administered to the population in general for 
example as a mixture of peptides giving rise to T cell 
immunity against various common cancers connected with 
frameshift mutation genes. 



10 



15 



20 



25 



30 



The peptides according to this invention may be 
administered as single peptides or as a mixture of 
peptides. Alternatively the peptides may be covalently 
linked with each other to form larger polypeptides or 
even cyclic polypeptides. 

A cancer therapy according to the present invention may 
be administered both in vivo or ex vivo having as the 
main goal the raising of specific T cell lines or clones 
against the mutant gene product associated with the 
cancer type with which the patient is afflicted. 

Further, the frameshift mutant peptides of this invention 
may be administered to a patient by various routes 
including but not limited to subcutaneous, intramuscular, 
intradermal, intraperitoneal, intravenous or the like. In 
one embodiment the peptides of this invention are 
administered intradermally . The peptides may be 
administered at single or multiple injection sites to a 
patient in a therapeutically or prophylactically 
effective amount. 

The peptides of this invention may be administered only 
once or alternatively several times, for instance once a 
week over a period of 1-2 months with a repeated sequence 
later all according to the need of the patient being 
treated . 



35 



The peptides of this invention can be administered in an 
amount in the range of 1 microgram (1 jig) to 1 gram (lg) 
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to an average human patient or individual to be 
vaccinated. It is preferred to use a smaller dose in the 
rage of 1 microgram (1 [ig) to 1 milligram (1 mg) for each 
administration . 

5 

The invention further encompasses DNA sequences which 
encodes a frameshift mutation peptide. 

The invention additionally encompasses isolated DNA 
10 sequences comprising a DNA sequence encoding at least one 
frameshift mutant peptide, and administration of such 
isolated DNA sequences as a vaccine for treatment or 
prophylaxis of cancers associated with frameshift 
mutations in the genes . 

15 

The peptides according to this invention may be 
administered to an individual in the form of DNA vaccines. 
The DNA encoding these peptides may be in the form of 
cloned plasmid DNA or synthetic oligonucleotide. The DNA 

20 may be delivered together with cytokines, such as IL-2, 
and/or other co-stimulatory molecules. The cytokines 
and/or co-stimulatory molecules may themselves be delivered 
in the form of plasmid or oligonucleotide DNA. The response 
to a DNA vaccine has been shown to be increased by the 

25 presence of immunostimulatory DNA sequences (ISS) . These 

can take the form of hexameric motifs containing methylated 
CpG, according to the formula: 

5 ' -purine-purine-CG-pyrimidine-pyrimidine-3 ' . Our DNA 
vaccines may therefore incorporate these or other ISS, in 
30 the DNA encoding the peptides, in the DNA encoding the 

cytokine or other co-stimulatory molecules, or in both. A 
review of the advantages of DNA vaccination is provided by 
Tighe et al (1998, Immunology Today, 19(2), 89-97). 

35 In one embodiment, the DNA sequence encoding the mutant 
BAX peptides comprises : 
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PCT/NO99/00143 



10 



15 



20 



25 



30 



35 



40 



45 



Normal BAX. 

ATG GGG GGG GAG GCA CCC GAG CTG GCC CTG GAC CCG GTG 



1G deleted from BAX gene sequence. 

ATG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC CTC 
AGG ATG CGT CCA CCA AGA AGC TGA 

2G deleted from BAX gene sequence. 

ATG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 
TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
CAT CGG GGA CGA ACT GGA CAG TAA 

1G inserted in BAX gene sequence. 

ATG GGG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 
TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
CAT CGG GGA CGA ACT GGA CAG TAA 

2G inserted in BAX gene sequence. 

ATG GGG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC 
CTC AGG ATG CGT CCA CCA AGA AGC TGA 



In a second embodiment, the DNA sequence encoding the 
mutant TGF0RII peptides comprises: 
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Normal TGF^RII gene. 

GAA AAA AAA AA G CCT GGT GAG ACT TTC TTC ATG TGT TCC. . . . 

5 

1A deleted f rom TGF(3RII gene sequence . 

GAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT GTA 

10 

GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG AAT 
ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 

15 

2A deleted from TGF|3RII gene sequence. 
GAA AAA AAA GCC TGG TGA 

20 

1A inserted in TGF|iRII gene sequence. 

25 

GAA AAA AAA AAA GCC TGG TGA 



30 

2A inserted in TGFpRII gene sequence . 

GAA AAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT 
35 GTA GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG 

AAT ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 

40 

The invention further encompasses vectors and plasmids 
comprising a DNA sequence encoding a frameshift mutant 
peptide. The vectors include, but are not limited to 
E.Coli plasmid, a Listeria vector and recombinant viral 
45 vectors. Recombinant viral vectors include, but are not 
limited to orthopox virus, canary virus, capripox virus, 
suipox virus, vaccinia, baculovirus, human adenovirus, 



10 
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SV40, bovine papilloma virus and the like comprising the 
DNA sequence encoding a frameshift mutant peptide. 

It is considered that an anticancer treatment or 
prophylaxis may be achieved also through the 
administration of an effective amount of a recombinant 
virus vector or plasmid comprising at least one insertion 
site containing a DNA sequence encoding a frameshift 
mutant peptide to a patient, whereby the patient's 
antigen presenting cells are turned into host cells for 
the vector/plasmid and presemtation of HLA/ frameshift 
mutation peptide complex is achieved. 

A person skilled in the art will find other possible use 
15 combinations with the peptides of this invention, and 

these are meant to be encompassed by the present claim. 

The peptides according to this invention may be produced 
by conventional processes as known in the art, such as 
chemical peptide synthesis, recombinant DNA technology or 
protease cleavage of a protein or peptide encoded by a 
frameshift mutated gene. One method for chemical 
synthesis is elucidated in the description below. 

In order for a cancer vaccine and methods for specific 
cancer therapy based on specific T cell immunity to be 
effective, three conditions must be met: 

1. The peptides used must correspond, either in their 
full length or after processing by antigen presenting 
cells, to the processed mutant protein fragment as 
presented by a HLA Class I and/or class II molecule on 
the cancer cell or other antigen presenting cells, 

2. The peptides used must be bound to a HLA Class I 
and/or Class II molecule in an immunogenic form, and 
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3. T-cells capable of recognising and responding to the 
HLA/peptide complex must be present in the circulation of 
the human being. 



5 It has been established that all these conditions are met 
for some representative peptides according to the present 
invention. The peptides according to the present 
invention give rise to specific T cell immune responses 
in vitro. It has been established that the peptides 
10 according to this invention correspond to processed 
mutant protein fragments. This is exemplified with 
peptides corresponding to fragments of transformed mutant 
BAX and TGFPRII peptides. 

15 Through the present invention the following advantages 
are achieved: 

- It offers a possibility to treat patients suffering 
from cancers arising from frame-shift mutations in their 
genes, most of which cancers known at present do not have 

20 any good treatment alternatives. 

- It offers a possibility to vaccinate prophylaxtically 
humans carrying genetic dispositions or belonging to 
other high risk groups. 

- It offers a possibility to prepare a combination 

25 treatment for a specific cancer, such as for instance 
colorectal or pancreatic cancers, wherein the cancer 
commonly is associated with either a frameshift mutation 
or a point mutation in the genes. 

-Since described frameshift mutations occurs in a large 
30 variety of cancers it will be possible to use this 

peptides in combination with established vaccines and 
future vaccines to obtain a multiple targetting 
treatment . 

-Likewise patients suffering from cancers associated with 
35 multiple frameshift mutations in genes can be treated 
more efficiently through a combination treatment. 



